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BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to novel proteins that inhibit human 
neutrophil elastase (hNE) . A large fraction of the sequence 
of each of these proteins is identical to a known human 
protein which has very little or no inhibitory activity with 
respect to hNE. 

Information Disclosure Statement 

!• hNE , its natural inhibitors, and pathologies 
Human Neutrophil Elastase (hNE, also known as Human 
Leukocyte Elastase (hLE); EC 3.4.21.11) is a 29 Kd protease 
with a wide spectrum of activity against extracellular 
matrix components ( CAMP 8 2 , CAMP88, MCWH8 9) . The enzyme is 
one of the major neutral proteases of the azurophil granules 
of polymorphonuclear leucocytes and is involved in the 
elimination of pathogens and in connective tissue 
restructuring (TRAV88) . In cases of hereditary reduction of 
the circulating a-l-protease inhibitor (API, formerly known 
as cxl antitrypsin) , the principal systemic physiological 
inhibitor of hNE (HEID86), or the inactivation of API by 
oxidation ("smoker's emphysema"), extensive destruction of 
lung tissue may result from uncontrolled elastolytic 
activity of hNE (CANT 8 9 ) . Several human respiratory 
disorders, including cystic fibrosis and emphysema, are 
characterized by an increased neutrophil burden on the 
epithelial surface of the lungs (SNID91, MCEL91, GOLD86) and 
hNE release by neutrophils is implicated in the progress of 
these disorders (MCEL91, WEIS89) . A preliminary study of 
aerosol administration of API to cystic fibrosis patients 
indicates that such treatment can be effective both in 
prevention of respiratory tissue damage and in augmentation 
of host antimicrobial defenses (MCEL91) . 

API presents some practical problems to large-scale 
routine use as a pulmonary anti-elas tolytic agent. These 



include the relatively large size of the molecule (394 
residues, 51 k Dalton) , the lack of intramolecular 
stabilizing disulfide bridges, and specific post 
translational modifications of the protein by glycosylation 
at three sites. Perhaps of even greater importance is the 
sensitivity of API to oxidation, such as those released by 
activated neutrophils. Hence a small stable nontoxic highly 
efficacious inhibitor of hNE would be of great therapeutic 
value . 

2. Proteinaceous Serine Protease Inhibitors . A large 
number of proteins act as serine protease inhibitors by 
serving as a highly specific, limited proteolysis substrate 
for their target enzymes. In many cases, the reactive site 
peptide bond ("scissile bond") is encompassed in at least 
one disulfide loop, which insures that during conversion of 
virgin to modified inhibitor the two peptide chains cannot 
dissociate . 

A special nomenclature has evolved for describing the 
active site of the inhibitor. Starting at the residue on 
the amino side of the scissile bond, and moving away from 
the bond, residues are named PI, P2, P3, etc. (SCHE67) . 
Residues that follow the scissile bond are called PI', P2 ' , 
P3', etc. It has been found that the main chain of protein 
inhibitors having very different overall structure are 
highly similar in the region between P3 and P3 ' with 
especially high similarity for P2, P 1 and PI' (LASK80 and 
works cited therein) . It is generally accepted that each 
serine protease has sites SI, S2, etc. that receive the side 
groups of residues PI, P2, etc. of the substrate or 
inhibitor and sites SI', S2 ' , etc. that receive the side 
groups of PI', P2 ' , etc. of the substrate or inhibitor 
(SCHE67) . It is the interactions between the S sites and 
the P side groups that give the protease specificity with 
respect to substrates and the inhibitors specificity with 
respect to proteases. 

The serine protease inhibitors have been grouped into 
families according to both sequence similarity and the 
topological relationship of their active site and disulfide 



loops. The families include the bovine pancreatic trypsin 
inhibitor (Kunitz) , pancreatic secretory trypsin inhibitor 
(Kazal), the Bowman-Birk inhibitor, and soybean trypsin 
inhibitor (Kunitz) families. Some inhibitors have several 
reactive sites on a single polypeptide chains, and these 
distinct domains may have different sequences, 
specificities, and even topologies. 

One of the more unusual characteristics of these 
inhibitors is their ability to retain some form of 
inhibitory activity even after replacement of the PI 
residue. It has further been found that substituting amino 
acids in the P 5 to P 5 ' region, and more particularly the P3 
to P3' region, can greatly influence the specificity of an 
inhibitor. LASK80 suggested that among the BPTI (Kunitz) 
family, inhibitors with PI Lys and Arg tend to inhibit 
trypsin, those with Pl=Tyr, Phe, Trp, Leu and Met tend to 
inhibit chymotrypsin, and those with Pl=Ala or Ser are 
likely to inhibit elastase. Among the Kazal inhibitors, 
they continue, inhibitors with PI = Leu or Met are strong 
inhibitors of elastase, and in the Bowman-Kirk family 
elastase is inhibited with PI Ala, but not with PI Leu. 

"Kunitz" Domain Proteinase Inhibitors . Bovine 
pancreatic trypsin inhibitor (BPTI, a.k.a. aprotonin) is a 
58 a. a. serine proteinase inhibitor of the BPTI (Kunitz) 
domain (KuDom) family. Under the tradename TRASYLOL, it is 
used for countering the effects of trypsin released during 
pancreatitis. Not only is its 58 amino acid sequence known, 
the 3D structure of BPTI has been determined at high 
resolution by X-ray diffraction (HUBE77, MARQ83, WLOD84, 
WLOD87a, WLOD87b) , neutron diffraction (WLOD84), and by NMR 
(WAGN87) . One of the X-ray structures is deposited in the 
Brookhaven Protein Data Bank as "6PTI" [sic] . The 3D 
structure of various BPTI homologues (EIGE90, HYNE90) are 
also known. At least sixty homologues have been reported; 
the sequences of 39 homologues are given in Table 13, and 
the amino acid types appearing at each position are compiled 
in Table 15. The known human homologues include domains of 
Lipoprotein Associated Coagulation Inhibitor (LACI) (WUNT88 , 



GIRA89), Inter-a-Trypsin Inhibitor (ALBR83a, ALBR83b, 
DIAR90, ENGH8 9 , TRIB86, GEBH8 6 , GEBH90 , KAUM8 6, ODOM90, 
SALI90), and the Alzheimer bet a -Amyloid Precursor Protein. 
Circularized BPTI and circularly permuted BPTI have binding 
properties similar to BPTI (GOLD83) . Some proteins 
homologous to BPTI have more or fewer residues at either 
terminus . 

In BPTI, the PI residue is at position 15. Tschesche 
et al. (TSCH87) reported on the binding of several BPTI PI 
derivatives to various proteases: 

Dissociation constants for BPTI PI derivatives, Molar. 

Residue Trypsin Chymotrypsin Elastase Elastase 

#15 (bovine (bovine (porcine (human 

£1 Pancreas) pancreas) pancreas) leukocytes) 

lysine 6.0-1CT 14 9.0-10" 9 - 3.5-10~ 6 (WT) 

glycine - + 7.0-10" 9 

alanine + - 2.8-10" 8 2.5-10" 9 



valine 



5.7-10" 8 1.1-lCT 10 



O 20 leucine - - 1.9-10" 8 2 



9-lCT ! 



From the report of Tschesche et al. we infer that 
molecular pairs marked " + " have K d s > 3.5-10" 6 M and that 
molecular pairs marked "-" have K d s » 3.5-10" 6 M. It is 
apparent that wild-type BPTI has only modest affinity for 
hNE, however, mutants of BPTI with higher affinity are 
known. While not shown in the Table, BPTI does not 
significantly bind hCG. However, Brinkmann and Tschesche 
(BRIN90) made a triple mutant of BPTI [viz. K15F, R17F, 
M52E) that has a K ± with respect to hCG of 5.0 x 10" 7 M. 



3 " ITI domain 1 and ITI do m ain 2 as an initial protein 
binding domains (IPBD) 

Many mammalian species have a protein in their plasma that 
can be identified, by sequence homology and similarity of 
physical and chemical properties, as inter-a-trypsin 
inhibitor (ITI), a large (M r ca 240,000) circulating protea; 
inhibitor (for recent reviews see ODOM90, SALI90, GEBH90, 



GEBH8 6) . The sequence of human ITI is shown in Table 400. 
The intact inhibitor is a glycoprotein and is currently 
believed to consist of three glycosylated subunits that 
interact through a strong glycosaminoglycan linkage (ODOM90, 
SALI90, ENGH89, SELL87) . The anti-trypsin activity of ITI 
is located on the smallest subunit (ITI light chain, 
unglycosylated M r ca 15,000) which is identical in amino acid 
sequence to an acid stable inhibitor found in urine (UTI) 
and serum (STI) (GEBH86, GEBH90) . The amino-acid sequence 
of the ITI light chain is shown in Table 400. The mature 
light chain consists of a 21 residue N-terminal sequence, 
glycosylated at Ser 10 , followed by two tandem Kunitz-type 
domains the first of which is glycosylated at Asn 45 (ODOM90) . 
In the human protein, the second Kunitz-type domain has been 
shown to inhibit trypsin, chymotrypsin, and plasmin 
(ALBR8 3a , ALBR83b, SELL87, SWAI88) . The first domain lacks 
these activities but has been reported to inhibit leukocyte 
elastase («1 uM>K ± >«l nM) (ALBR8 3a , b , ODOM90) . cDNA encoding 
the ITI light chain also codes for a-l-microglobulin 
(TRAB86, KAUM86, DIAR90); the proteins are separated post- 
translationally by proteolysis. 

The two Kunitz domains of the ITI light chain (ITI-D1 and 
ITI-D2) possesses a number of characteristics that make them 
useful as Initial Potential Binding Domains (IPBDs) . ITI-D1 
comprises at least residues 26 to 76 of the UTI sequence 
shown in Fig. 1 of GEBH8 6. The Kunitz domain could be 
thought of as comprising residues from as early as residue 
22 to as far as residue 79. Residues 22 through 79 
constitute a 58-amino-acid domain having the same length as 
bovine pancreatic trypsin inhibitor (BPTI) and having the 
cysteines aligned. ITI-D2 comprises at least residues 82 
through 132; residues as early as 78 and as later as 135 
could be included to give domains closer to the classical 
58-amino-acid length. As the space between the last 
cysteine of ITI-D1 (residue 76 of ITI light chain) and the 
first cysteine of ITI-D2 (residue 82 of ITI light chain) is 
only 5 residues, one can not assign 58 amino acids to each 
domain without some overlap. Unless otherwise stated, 



herein, we have taken the second domain to begin at residue 
78 of the ITI light chain. Each of the domains are highly 
homologous to both BPTI and the EpiNE series of proteins 
described in US patent 5,223,409. Although x-ray structures 
of the isolated domains ITT-D1 and ITI-D2 are not available, 
crystallographic studies of the related Kunitz-type domain 
isolated from the Alzheimer's amyloid 3-protein (AA(3P) 
precursor show that this polypeptide assumes a 3D structure 
almost identical to that of BPTI (HYNE90) . 

The three-dimensional structure of a-dendrotoxin from 
green mamba venom has been determined (SKAR92) and the 
structure is highly similar to that of BPTI. The author 
states, "Although the main-chain fold of a-DTX is similar 
to that of homologous bovine pancreatic trypsin inhibitor 
(BPTI), there are significant differences involving segments 
of the polypeptide chain close to the ' antiprotease site' of 
BPTI. Comparison of the structure of a-DTX with the 
existing models of BPTI and its complexes with trypsin and 
kallikrein reveals structural differences that explain the 
inability of a-DTX to inhibit trypsin and chymotrypsin . " 

The structure of the black mamba K venom has been 
determined by NMR spectroscopy and has a 3D structure that 
is highly similar to that of BPTI despite 32 amino-acid 
sequence differences between residues 5 and 55 (the first 
and last cysteines) (BERN93) . "The solution structure of 
Toxin K is very similar to the solution structure of the 
basic pancreatic trypsin inhibitor (BPTI) and the X-ray 
crystal structure of the a-dendrotoxin from Dendroaspis 
angusticeps (a-DTX), with r.m.s.d. values of 1.31 A and 0.92 
A, respectively, for the backbone atoms of residues 2 to 56. 
Some local structural differences between Toxin K and BPTI 
are directly related to the fact that intermolecular 
interactions with two of the four internal molecules of 
hydration water in BPTI are replaced by intramolecular 
hydrogen bonds in Toxin K . " Thus, it is likely that the 
solution 3D structure of either of the isolated ITI-D1 
domain or of the isolated ITI-D2 domain will be highly 
similar to the structures of BPTI, AAf3P, and black mamba K 



venom. In this case, the advantages described previously 
for use of BPTI as an IPBD apply to ITI-D1 and to ITI-D2. 
ITI-D1 and ITI-D2 provide additional advantages as an IPBD 
for the development of specific anti-elastase inhibitory 
activity. First, the ITI-D1 domain has been reported to 
inhibit both leukocyte elastase ( ALBR8 3a , b, ODOM90) and 
Cathepsin-G (SWAI88, ODOM90); activities which BPTI lacks. 
Second, ITI-D1 lacks affinity for the related serine 
proteases trypsin, chymotrypsin, and plasmin ( ALBR8 3a , b, 
SWAI88), an advantage for the development of specificity in 
inhibition. ITI-D2 has the advantage of not being 
glycosylated. Additionally, ITI-D1 and ITI-D2 are human- 
derived polypeptides so that derivatives are anticipated to 
show minimal antigenicity in clinical applications. 

4. Secretion of heterologous proteins from Pichia vastoris 
Others have produced a number of proteins in the yeast 
Pichia pastoris. For example, Vedvick et al . ( VEDV91 ) and 
Wagner et al. (WAGN92) produced aprotinin from the alcohol 
oxidase promoter with induction by methanol as a secreted 
protein in the culture medium (CM) at ~1 mg/mL. Gregg et 
al. (GREG93) have reviewed production of a number of 
proteins in P. pastoris. Table 1 of GREG93 shows proteins 
that have been produced in P. pastoris and the yields. 

5. Rec ombinant production of Kunitz Domains: 
Aprotinin has been made via recombinant-DNA technology 
(AUER87, AUER88, AUER89, AUER90, BRIN90, BRIN91, ALTM91) . 

6. Construction methods: 

Unless otherwise stated, genetic constructions and other 
manipulations are carries out by standard methods, such as 
found in standard references (e.g. AUSU87 and SAMB89). 

No admission is made that any cited reference is prior 
art or pertinent prior art, and the dates given are those 
appearing on the reference and may not be identical to the 
actual publication date. The descriptions of the teachings 



of any cited reference are based on our present reading 
thereof, and we reserve the right to revise the description 
if an error comes to our attention, and to challenge whether 
the description accurately reflects the actual work 
reported. We reserve the right to challenge the 
interpretation of cited works, particularly in light of new 
or contradictory evidence. 
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SUMMARY OF THE INVENTION 

The present invention describes a series of small potent 
proteinaceous inhibitors of human neutrophil elastase (hNE) . 
One group of inhibitors is derived from a Kunitz-type 
inhibitory domain found in a protein of human origin, 
namely, the light chain of human Inter-a-trypsin inhibitor 
(ITI) which contains domains designated ITI-D1 and ITI-D2. 
The present invention discloses variants of ITI-D1 and ITI- 
D2 that have very high affinity for hNE. The present 
invention comprises modifications to the ITI-D2 sequence 
that facilitate its production in the yeast Pichia pastoris 
and that are highly potent inhibitors of hNE. The invention 
also relates to methods of transferring segments of sequence 
from one Kunitz domain to another and to methods of 
production. 

The invention is presented as a series of examples that 
describe design, production, and testing of actual 
inhibitors and additional examples describing how other 
inhibitors could be discovered. The invention relates to 
proteins that inhibit human neutrophil elastase (hNE) with 
high affinity. 

NOMENCLATURE and ABBREVIATIONS 



Term Meaning 

x;; y Fusion of gene x to gene y in frame. 

X::Y Fusion protein expressed from x::y fusion gene. 

UM Micromolar, 10~ 5 molar. 

nM Namomolar, 10" 9 molar. 

pM Picomolar, 1CT 12 molar. 



Single- 


letter amino- 


•acid 


codes : 




A: Ala 


C: 


Cys 


D 


: Asp 


E: 


Glu 


F: Phe 


G: 


Gly 


H 


: His 


I: 


He 


K: Lys 


L: 


Leu 


M 


: Met 


N : 


Asn 


P: Pro 




Q: 


Gin 


R: 


Arg 


S 


T: Thr 


V: 


Val 


W: 


: Trp 


Y: 


Tyr 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS OF THE 
INVENTION 

A protein sequence can be called an "aprotinin-like 
Kunitz domain" if it contains a sequence that when aligned 
to minimize mismatches, can be aligned, with four or fewer 
mismatches, to the pattern: 

Cys- (Xaa) 6 -Gly-Xaa-Cys- (Xaa) 8 - [Tyr | Phe] - (Xaa) 6 -Cys- (Xaa) 2 -Phe- 
Xaa- [Tyr | Trp | Phe] -Xaa-Gly-Cys- (Xaa) 4 - [Asn ] Gly] -Xaa- [Phe | Tyr] - 
(Xaa) 5 -Cys- (Xaa) 3-Cys (SEQ ID NO: 86), where bracketed amino 
acids separated by a | symbol are alternative amino acids 
for a single position. For example, [Tyr | Phe] indicates 
that at that position, the amino acid may be either Tyr or 
Phe. The symbol Xaa denotes that at that position, any 
amino acid may be used. For the above test, an insertion or 
deletion counts as one mismatch. 

In aprotonin, the cysteines are numbered 5, 14, 30, 38, 
51, and 55 and are joined by disulfides 5-to-55, 14-to-38, 
and 30-to-51. Residue 15 is called the PI residue (SCHE67); 
residues toward the amino terminus are called P2 (residue 
14), P3 (residue 13), etc. Residue 16 is called PI', 17 is 
P2 ' , 18 is P3 ' , etc. 

There are many homologues of aprotonin, which differ from it 
at one or more positions but retain the fundamental 
structure defined above. For a given list of homologues, it 
is possible to tabulate the frequency of occurrence of each 
amino acid at each ambiguous position. (The sequence having 
the most prevalent amino acid at each ambiguous position is 
listed as "Consensus Kunitz Domain" in Table 100) . 

A "human aprotonin-like Kunitz domain" is an aprotonin- 
like Kunitz domain which is found in nature in a human 
protein. Human aprotonin-like Kunitz domains include, but 
are not limited to, ITI-D1, ITI-D2, App-I, TFPI2-D1, TFPI2- 
D2, TFPI2-D3, LACI-D1, LACI-D2, LACI-D3, A3 collagen, and 
the HKI B9 domain. In this list, Dl, D2 , etc., denote the 
first, second, etc. domain of the indicated multidomain 
protein. 

"Weak", "Moderate", "Strong" and "Very Strong" binding to 
and inhibition of hNE are defined in accordance with Table 
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55. Preferably, the proteins of the present invention have 
a Ki of less than 1000 pM (i.e., are "strong" inhibitors), 
more preferably less than 50 pM, most preferably less than 
10 pM (i.e., are "very strong" inhibitors). 

For purposes of the present invention, an aprotonin-like 
Kunitz domain may be divided into ten segments, based on the 
consensus sequence and the location of the catalytic site. 
Using the amino acid numbering scheme of aprotonin, these 
segments are as follows (see Table 100) : 
1: 1-4 (residues before first Cys) 

2: 5-9 (first Cys and subsequent residues before P6) 

3: 10-13 (P6 to P3) 

4: 14 (second Cys; P2) 

5: 15-21 (PI, and PI' to P6') 

6: 22-30 (after P6 and up to and incl . third Cys.) 

7: 31-36 (after third Cys and up to consensus Gly-Cys) 

8: 37-38 (consensus Gly-Cys) 

9: 39-42 (residues after Gly-Cys and before consensus 
[Asnf Gly] 

10: 43-55 (up to last Cys) (also includes residues after last 
Cys, if any) 

It will be appreciated that in those aprotonin-like 
Kunitz domains that differ from aprotonin by one or more 
amino acid insertions or deletions, or which have a 
different number of amino acids before the first cysteine or 
after the last cysteine, the actual amino acid position may 
differ from that given above. It is applicant's intent that 
these domains be numbered so as to correspond to the aligned 
aprotonin sequence, e.g., the first cysteine of the domain 
is numbered amino acid 5, for the purpose of segment 
identification. Note that segment 1, while a part of 
aprotonin, is not a part of the formal definition of an 
aprotonin-like Kunitz domain, and therefore it is not 
required that the proteins of the present invention include 
a sequence corresponding to segment 1. Similarly, part of 
segment 10 (after the last Cys) is not a required part of 
the domain. 

A "humanized inhibitor" is one in which at least one of 



13 

segments 3, 5, 7 and 9 differs by at least one 

nonconservative modification from the most similar (based on 
amino acid identities) human aprotonin-like Kunitz domain, 
at least one of segments 2, 6, and 10 (considered up to the 
last Cys) is identical, or differs only by conservative 
modifications, from said most similar human aprotonin-like 
Kunitz domain, and which is not- identical to any naturally 
occurring nonhuman aprotonin-like Kunitz domain. (Note that 
segment 1 is ignored in making this determination since it 
is outside the sequence used to define a domain, and 
segments 4 and 8 are ignored because they are required by 
the definition of an aprotonin-like Kunitz domain.) 

The proteins of the present invention are preferably 
humanized strong or very strong hNE inhibitors. It should 
be noted that the human aprotonin-like Kunitz domains thus 
far identified are merely weak hNE inhibitors. 

For the purpose of the appended claims, an aprotonin-like 
Kunitz domain is "substantially homologous" to a reference 
domain if, over the critical region (aprotonin residues 5- 
55) set forth above, it is at least at least 50% identical 
in amino acid sequence to the corresponding sequence of or 
within the reference domain, and all divergences take the 
form of conservative and/or semi-conservative modifications. 

Proteins of the present invention include those 
comprising a Kunitz domain that is substantially homologous 
to the reference proteins EPI-HNE-3, EPI-HNE-4, DPI. 1.1, 
DPI. 1.2, DPI. 1.3, DPI. 2.1, DPI. 2. 2, DPI. 2. 3, DPI. 3.1, 
DPI. 3. 2, DPI. 3. 3, DPI .4.1, DPI. 4. 2, DPI .4.3, DPI. 5.1, 
DPI. 5. 2, DPI. 5. 3, DPI. 6.1, DPI. 6.2, DPI. 6. 3, DPI. 6. 4, 
DPI. 6.5, DPI. 6. 6, DPI. 6.7, DPI. 7.1, DPI. 7. 2, DPI. 7. 3, 
DPI. 7. 4, DPI. 7. 5, DPI. 8.1, DPI. 8. 2, DPI. 8. 3, DPI. 9.1, 
DPI. 9. 2, or DPI. 9. 3, as defined in Table 100. Homologues of 
EPI-HNE-3 and EPI-HNE-4 are especially preferred. 

Preferably, the hNE-binding domains of the proteins of 
the present invention are at least 80% identical, more 
preferably, at least 90% identical, in amino acid sequence 
to the corresponding reference sequence. Most preferably, 



the number of mismatches is zero, one, two, three, four or 
five. Desirably, the hNE-binding domains diverge from the 
reference domain solely by one or more conservative 
modifications . 

"Conservative modifications" are defined as: 

a) conservative substitutions of amino acids as 
hereafter defined, and 

b) single or multiple insertions or deletions of amino 
acids at the termini, at interdomain boundaries, in 
loops or in other segments of relatively high 
mobility (as indicated, for example, by high 
temperature factors or lack of resolution in X-ray 
diffraction, neutron diffraction, or NMR) . 
Preferably, except at the termini, no more than 
about five amino acids are inserted or deleted at a 
particular locus, and the modifications are outside 
regions known to contain binding sites important to 
activity . 

"Conservative substitutions" are herein defined as 
exchanges within on of the following five groups: 
I. Small aliphatic, nonpolar or slightly polar 
residues: [Ala, Ser, Thr, (Pro, Gly) ] , 
II. Acidic amino acids and their amides: [Asp, Glu, Asn, 
Gin] , 

III. Polar, positively charged residues: [His, Lys, Arg] , 
IV. Aliphatic nonpolar residues: [Met, Leu, lie, Val, 
(Cys)], and 

V. Large, aromatic residues: [Phe, Tyr, Trp] 
Residues Pro, Gly, and Cys are parenthesized because they 
have special conformational roles. Cys often participates 
in disulfide bonds; when not so doing, it is highly 
hydrophobic. Gly imparts flexibility to the chain; it is 
often described as a "helix breaker" although many a helices 
contain Gly. Pro imparts rigidity to the chain and is also 
described as a "helix breaker". Although Pro is most often 
found in turns, Pro is also found in helices and sheets. 
These residues may be essential at certain positions and 
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substitutable elsewhere. 

Semi-Conservative Modifications" are defined herein as 
transpositions of adjacent amino acids (or their 
conservative replacements), and semi-conservative 
5 substitutions. "Semi-conservative substitutions" are 

defined to be exchanges between two of groups (I)-(V) above 
which are limited either to the supergroup consisting of 
(I), (II), and (III) or to the supergroup consisting of (IV) 
and (V) . For the purpose of this definition, however, 
10 glycine and alanine are considered to be members of both 
supergroups . 

"Non-conservative modifications" are modifications which 
are neither conservative nor semi-conservative. 

Preferred proteins of the present invention are further 
15 characterized by one of more of the preferred, highly 

preferred, or most preferred mutations set forth in Table 
711 . 

Preferably, the proteins of the present invention have 
hNE-inhibitory domains which are not only substantially 

20 homologous to a reference domain, but also qualify as 
humanized inhibitors. 

Claim 1 of PCT/US92/01501 refers to proteins denoted 
EpiNEalpha, EpiNEl, EpiNE2 , EpiNE3, EpiNE4 , EpiNE5, EpiNE6, 
EpiNE7, and EpiNE8 . Claim 3 refers to proteins denoted ITI- 

25 E7, BITI-E7, BITI-E&-1222 , AMIN01, AMIN02, MUTPl, BITI-E7- 
141, MUTT26A, MUTQE, and MUT1619. (With the exception of 
EpiNEalpha, the sequences of all of these domains appears in 
Table 100.) Claims 4-6 related to inhibitors which are 
homologous to, but not identical with, the aforementioned 

30 inhibitors. These homologous inhibitors could differ from 
the lead inhibitors by one or more class A substitutions 
(claim 4), one or more class A or B substitutions (claim 5), 
or one or more class A, B or C substitutions (claim 6) . 
Class A, B and C substitutions were defined in Table 65 of 

35 PCT/US92/01501. For convenience, Table 65 has been 
duplicated in this specification. 

The meaning of classes A, B and C were as follows: A, no 
major effect expected if molecular charge stays in range -1 
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to +1; B, major effects not expected, but more likely than 
with A; and C, residue in binding interface, any change must 
be tested. Each residue position was assigned an A, B, C or 
X rating; X meant no substitution allowed. At the non-X 
positions, allowed substitutions were noted. 

In one series of embodiments, the present invention is 
directed to HNE inhibitors as disclosed in 08/133,031 
(previously incorporated by reference), which is the U.S. 
national stage of PCT/US92/01501 . 

The invention disclosed in 08/133,031 relates to muteins 
of BPTI, ITI-D1 and other Kunitz domain-type inhibitors 
which have a high affinity for elastase. Some of the 
described inhibitors are derived from BPTI and some from 
ITI-D1. However, hybrids of the identified muteins and 
other Kunitz domain-type inhibitors could be constructed. 

For the purpose of simultaneously assessing the affinity 
of a large number of different BPTI and ITI-D1 muteins, DNA 
sequences encoding the BPTI or ITI-DI was incorporated into 
the genome of the bacteriophage M13 . The KuDom is displayed 
on the surface of M13 as an amino-terminal fusion with the 
gene III coat protein. Alterations in the KuDom amino acid 
sequence were introduced. Each pure population of phage 
displaying a particular KuDom was characterized with regard 
to its interactions with immobilized hNE or hCG . Based on 
comparison to the pH elution profiles of phage displaying 
other KuDoms of known affinities for the particular 
protease, mutant KuDoms having high affinity for the target 
proteases were identified. Subsequently, the sequences of 
these mutant KuDoms were determined (typically by sequencing 
the corresponding DNA sequence) . 

Certain aprotonin-like protease inhibitors were shown to 
have a high affinity for HNE (*10 12 /M) . These 58 amino acid 
polypeptides were biologically selected from a library of 
aprotinin mutants produced through synthetic diversity. 
Positions PI, PI', P2 ' , P3 ' , and P4 ' were varied. At Pi, 
only VAL and ILE were selected, although LEU, PHE, and MET 
were allowed by the synthetic conditions. At Pi', ALA and 
GLY were allowed and both were found in proteins having high 
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affinity. (While not explored in the library, many Kazal 
family inhibitors of serine proteases have glutamic or 
aspartic acid at PI'.) All selected proteins contained 
either PHE or MET at P2 ' ; LEU, ILE, and VAL, which are amino 
acids with branched aliphatic side groups, were in the 
library but apparently hinder binding to HNE. Surprisingly, 
position P3' of all proteins selected for high affinity for 
HNE have phenylalanine. No one had suggested that P3 ' was a 
crucial position for determining specificity relative to 
HNE. At P4', SER, PRO, THR, LYS, and GLN were allowed; all 
of these except THR were observed. PRO and SER are found in 
the derivatives having the highest affinity. 

In 08/133,031, Table 61 showed the variability of 39 
naturally-occurring Kunitz domains. All these proteins have 
51 residues in the region C 5 through C 55 ; the total number of 
residues varies due to the proteins having more or fewer 
residues at the termini. Table 62 list the names of the 
proteins that are included in Table 61. Table 64 cites 
works where these sequences are recorded. Table 63 shows a 
histogram of how many loci show a particular variability vs. 
the variability. "Core" refers to residues from 5 to 55 
that show greater sequence and structural similarity than do 
residues outside the core. 

At ten positions a single amino-acid type is observed in 
all 42 cases, these are C 5 , G 12 , C 14 , C 30 , F 33 , G 37 , C 38 , N 43 , C 51 , 
and C 55 . Although there are reports that each of these 
positions may be substituted without complete loss of 
structure, only G 12 , C 14 , G 37 , and C 38 are close enough to the 
binding interface to offer any incentive to make changes. 
G 12 is in a conformation that only glycine can attain; this 
residue is best left as is. Marks et al. (MARK 8 7 ) replaced 
both C 14 and C 38 with either two alanines or two threonines. 
The C 14 /C 38 cystine bridge that Marks et al. removed is the 
one very close to the scissile bond in BPTI; surprisingly, 
both mutant molecules functioned as trypsin inhibitors. 
Both BPTI (C14A,C38A) and BPTI (C14T, C38T ) are stable and 
inhibit trypsin. Altering these residues might give rise to 
a useful inhibitor that retains a useful stability, and the 
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phage-display of a variegated population is the best way to 
obtain and test mutants that embody alterations at either 14 
or 38. Only if the C 14 /C 38 disulfide is removed, would the 
strict conservation of G 37 be removed. 

At seven positions (viz. 23, 35, 36, 40, 41, 45, and 47) 
only two amino-acid types have been found. At position 23 
only Y and F are observed; the para position of the phenyl 
ring is solvent accessible and far from the binding site. 
Changes here are likely to exert subtle influences on 
binding and are not a high priority for variegation. 
Similarly, 35 has only the aromatic residues Y and W; 
phenylalanine would probably function well here. At 36, 
glycine predominates while serine is also seen. Other amino 
acids, especially {N, D, A, R}, should be allowed and would 
likely affect binding properties. Position 40 has only G or 
A; structural models suggest that other amino acids would be 
tolerated, particularly those in the set {S, D, N, E, K, R, 
L, M, Q, and T}. Position 40 is close enough to the binding 
site that alteration here might affect binding. At 41, only 
N, and K have been seen, but any amino acid, other than 
proline, should be allowed. The side group is exposed, so 
hydrophilic side groups are preferred, especially {D, S, T, 
E, R, Q, and A}. This residue is far enough from the 
binding site that changes here are not expected to have big 
effects on binding. At 45, F is highly preferred, but Y is 
observed once. As one edge of the phenyl ring is exposed, 
substitution of other aromatics (W or H) is likely to make 
molecules of similar structure, though it is difficult to 
predict how the stability will be affected. Aliphatics such 
as leucine or methionine (not having branched C p s) might also 
work here. At 47, only S and T have been seen, but other 
amino acids, especially {N, D, G, and A}, should give stable 
proteins . 

At one position (44), only three amino-acid types have 
been observed. Here, asparagine predominates and may form 
internal hydrogen bonds. Other amino acids should be 
allowed, excepting perhaps proline. 

At the remaining 4 0 positions, four or more amino acids 



have been observed; at 28 positions, eight or more amino- 
acid types are seen. Position 25 exhibits 13 different 
types and 5 positions (1, 6, 17, 26, and 34) exhibit 12 
types. Proline (the most rigid amino acid) has been 
observed at fourteen positions: 1, 2, 8, 9, 11, 13, 19, 25, 
32, 34, 39, 49, 57, and 58. The 4>,i|/ angles of BPTI (CREI84, 
Table 6-3, p. 222) indicate that proline should be allowed 
at positions 1, 2, 3, 7, 8, 9, 11, 13, 16, 19, 23, 25, 26, 
32, 35, 36, 40, 42, 43, 48, 49, 50, 52, 53, 54, 56, and 58. 
Proline occurs at four positions (34, 39, 57, and 58) where 
the BPTI angles indicate that it should be unacceptable. 

We conclude that the main chain rearranges locally in these 
cases . 

Based on these data and excluding the six cysteines, we 
judge that the KuDom structure will allow those 
substitutions shown in Table 65. The class indicates 
whether the substitutions: A) are very likely to give a 
stable protein having substantially the same binding to hNE, 
hCG, or some other serine protease as the parental seguence, 
B) are likely to give similar binding as the parent, or C) 
are likely to give a proteins retaining the KuDom structure, 
but which are likely to affect the binding. Mutants in class 
C must be tested for affinity, which is relatively easy 
using a display-phage system, such as the one set forth in 
WO/02809. The affinity of hNE and hCG inhibitors is most 
sensitive to substitutions at positions 15, 16, 17, 18, 34, 
39, 19, 13, 11, 20, 36 of BPTI, if the inhibitor is a mutant 
of ITI-D1, these positions must be converted to their ITI-D1 
equivalents by aligning the cysteines in BPTI and ITI-Dl. 

Wild-type BPTI is not a good inhibitor of hNE. BPTI with 
a single K15L mutation exhibits a moderate affinity for HNE 
(K d - 2.9-10- 9 M) (BECK88b) . However, the amino terminal 
Kunitz domain (BI-8e) of the light chain of bovine inter-a- 
trypsin inhibitor has been generated by proteolysis and 
shown to be a potent inhibitor of HNE (K d = 4 . 4-10" 11 M) 
(ALBR83) . 

It has been proposed that the PI residue is the primary 
determinant of the specificity and potency of BPTI-like 
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molecules (SINH91, BECK8 8b , LASK80 and works cited therein) . 
Although both BI-8e and BPTI (K15L) feature LEU at their 
respective PI positions, there is a 66 fold difference in 
the affinities of these molecules for HNE . We therefore 
hypothesized that other structural features must contribute 
to the affinity of BPTT-like molecules for HNE. 

A comparison of the structures of BI-8e and BPTI (K15L) 
reveals the presence of three positively charged residues at 
positions 39, 41, and 42 of BPTI which are absent in BI-8e. 
These hydrophilic and highly charged residues of BPTI are 
displayed on a loop which underlies the loop containing the 
PI residue and is connected to it via a disulfide bridge. 
Residues within the underlying loop (in particular residue 
39) participate in the interaction of BPTI with the surface 
of trypsin (BLOW72) and may contribute significantly to the 
tenacious binding of BPTI to trypsin. These hydrophilic 
residues might, however, hamper the docking of BPTI variants 
with HNE. Supporting this hypothesis, BI-8e displays a high 
affinity for HNE and contains no charged residues in 
residues 39-42. Hence, residues 39 through 42 of wild type 
BPTI were replaced with the corresponding residues (MGNG) of 
the human homologue of BI-8e. As we anticipated, a 
BPTI(K15L) derivative containing the MGNG 39-42 substitution 
exhibited a higher affinity for HNE than did the single 
substitution mutant BPTI (K15L) . Mutants of BPTI with Met 
at position 39 are known, but positions 40-42 were not 
mutated simultaneously. 

Tables 207 and 208 present the sequences of additional 
novel BPTI mutants with high affinity for hNE . We believe 
these mutants to have an affinity for hNE which is about an 
order of magnitude higher than that of BPTI (K15V, R17L) . 
All of these mutants contain, besides the active site 
mutations shown in the Tables, the MGNG mutation at 
positions 39-42. 

Although BPTI has been used in humans with very few 
adverse effects, a KuDom having much higher similarity to a 
human KuDom poses much less risk of causing an immune 
response. Thus, we transferred the active site changes 



found in EpiNE7 into the first KuDom of inter-a-trypsin 
inhibitor. For the purpose of this application, the 
numbering of the nucleic acid sequence for the ITI light 
chain gene is that of TRAB8 6 and that of the amino acid 
sequence is the one shown for UTI in Fig. 1 of GEBH86. The 
necessary coding sequence for ITI-DI is the 168 bases 
between positions 750 and 917 in the cDNA sequence presented 
in TRAB86. The amino acid sequence of human ITI-DI is 56 
amino acids long, extending from Lys-22 to Arg-77 of the 
complete ITI light chain sequence. The PI site of ITI-DI is 
Met-36. Tables 220-221 present certain ITI mutants; note 
that the residues are numbered according to the homologus 
Kunitz domain of BPTI, i.e., with the PI residue numbered 

15. It should be noted that it is probably acceptable to 
truncate the amino-terminal of ITI-DI, at least up to the 
first residue homologous with BPTI. 

The EpiNE7-inspired mutation (BPTI 15-19 region) of ITI- 
DI significantly enhanced its affinity for hNE . We also 
discovered that mutation of a different part of the molecule 
(BPTI 1-4 region) provided a similar increase in affinity. 
When these two mutational patterns were combined, a 
synergistic increase in affinity was observed. Further 
mutations in nearby amino acids (BPTI 26, 31, 34) led to 
additional improvements in affinity. 

The elastase-binding muteins of ITI-DI envisioned herein 
preferably differ from the wild-type domain at one or more 
of the following positions (numbered per BPTI) : 1, 2, 4, 15, 

16, 18, 19, 31 and 34. More preferably, they exhibit one or 
more of the following mutations: Lysl -> Arg; Glu2 -> Pro; 
Ser4 -> Phe*; Metl5 -> Val*, He; Glyl6 -> Ala; THrl8 -> 
Phe*; Serl9 -> Pro; Thr26 -> ALa; Glu31 -> Gin; Gln34 -> 
Val*. Introduction of one or more of the starred mutations 
is especially desirable, and, in one preferred embodiment, 
at least all of the starred mutations are present. 

In a second series of embodiments, the present invention 
relates to Kunitz-type domains which inhibit HNE, but 
excludes those domains corresponding exactly to the lead 
domains of claims 1 and 3 of PCT/US92 /01501 . Preferably, 
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such domains also differ from these lead domains by one or 
more mutations which are not class A substitutions, more 
preferably, not class A or B substitutions, and still more 
preferably, not class A, B or C substitutions, as defined in 
Table 65. Desirably, such domains are each more similar to 
one of the aforementioned reference proteins than to any of 
the lead proteins set forth in PCT/US92/01501 . 

The examples contain numerous examples of amino-acid 
sequences accompanied by DNA sequences that encode them. It 
is to be understood that the invention is not limited to the 
particular DNA sequence shown. 
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Example 1 : Expression and display of BPTI. ITI-D1, and other 
Kunitz Domains. 

Table 30 shows a display gene that encodes: 1) the Ml 3 III 
signal peptide, 2) BPTI, and 3) the first few amino-acids of 
5 mature M13 III protein. Phage have been made in which this 
gene is the only iii-like gene so that all copies of III 
expressed are expected to be modified at the amino terminus 
of the mature protein. Substitutions in the BPTI domain 
can be made in the cassettes delimited by the AccIII, XhoT, 

10 PflMI, Apal, BssHII, StuI, Xcal , Espl, SphI , or Narl (same 
recognition as KasI) sites. Table 100 gives amino-acid 
sequences of a number of Kunitz domains, some of which 
inhibit hNE. Each of the hNE-inhibiting sequences shown in 
Table 100 can be expressed as an intact hNE-binding protein 

15 or can be incorporated into a larger protein as a domain. 

Proteins that comprise a substantial part of one of the hNE- 
inhibiting sequences found in Table 100 are expected to 
exhibit hNE-inhibitory activity. This is particularly true 
if the sequence beginning with the first cysteine and 

20 continuing through the last cysteine is retained. 

ITI domain 1 is a Kunitz domain as discussed below. The 
ability of display phage to be retained on matrices that 
display hNE is related to the affinity of the particular 
Kunitz domain (or other protein) displayed on the phage. 

25 Expression of the ITI domain lzziii fusion gene and display 
of the fusion protein on the surface of phage were 
demonstrated by Western analysis and phage titer 
neutralization experiments. The infectivity of ITI-Dl- 
display phage was blocked by up to 99% by antibodies that 

30 bind ITI while wild-type phage were unaffected. 

Table 35 gives the sequence of a fusion gene comprising: 
a) the signal sequence of M13 III, b) ITI-D1, and c) the 
initial part of mature III of M13. The displayed ITI-D1 
domain can be altered by standard methods including: i) 

35 oligonucleotide-directed mutagenesis of single-stranded 

phage DNA, and ii) cassette mutagenesis of RF DNA using the 
restriction sites {BglZ, EagT, Ncol , Styl, PstI, and KasI 
(two sites)) designed into the gene. 
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Example 2: Fractionation of MA-ITI-D1 phage bound to 
agarose-immobilized protease beads. 

To test if phage displaying the ITI-D1::III fusion protein 
interact strongly with the proteases human neutrophil 
elastase (hNE) , aliquots of display phage were incubated 
with agarose-immobilized hNE beads ( "hNE beads") . The beads 
were washed and bound phage eluted by pH fractionation as 
described in US 5,223,409. The pHs used in the step 
gradient were 7.0, 6.0, 5.5, 5.0, 4.5, 4.0, 3.5, 3.0, 2.5, 
and 2.0. Following elution and neutralization, the various 
input, wash, and pH elution fractions were titered. Phage 
displaying ITI-D1 were compared to phage that display EpiNE- 
7 . 

The results of several fractionations are shown in Table 
212 (EpiNE-7 or MA-ITI-D1 phage bound to hNE beads) . The pH 
elution profiles obtained using the control display phage 
(EpiNE-7) were similar previous profiles (US 5,223,409). 
About 0.3% of the EpiNE-7 display phage applied to the hNE 
beads eluted during the fractionation procedure and the 
elution profile had a maximum for elution at about pH 4.0. 

The MA-ITT-Dl phage show no evidence of great affinity 
for hNE beads. The pH elution profiles for MA-ITI-D1 phage 
bound to hNE beads show essentially monotonic decreases in 
phage recovered with decreasing pH . Further, the total 
fractions of the phage applied to the beads that were 
recovered during the fractionation procedures were quite 
low: 0.002%. 

Published values of K ± for inhibition neutrophil elastase 
by the intact, large (M r =240,000) ITI protein range between 
60 and 150 nM (SWAI88, ODOM90) . Our own measurements of pH 
fraction of display phage bound to hNE beads show that phage 
displaying proteins with low affinity (>1 yiM) for hNE are 
not bound by the beads while phage displaying proteins with 
greater affinity (nM) bind to the beads and are eluted at 
about pH 5. If the first Kunitz-type domain of the ITI 
light chain is entirely responsible for the inhibitory 
activity of ITI against hNE, and if this domain is correctly 
displayed on the MA-ITI-D1 phage, then it appears that the 
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minimum affinity of an inhibitor for hNE that allows binding 
and fractionation of display phage on hNE beads is between 
50 and 100 nM . 

5 Example 3 : Alteration of the PI region of ITI-D1. 

We assume that ITI-D1 and EpiNE-7 have the same 3D 
configuration in solution as BPTI . Although EpiNE-7 and 
ITI-D1 are identical at positions 13, 17, 20, 32, and 39, 
they differ greatly in their affinities for hNE. To improve 

10 the affinity of ITI-D1 for hNE, the EpiNE-7 sequence Val 15 - 

Ala -| ? -Met 17 - Phe ie - Pro 1 c,-Arg 20 (bold, underscored amino acids are 
alterations) was incorporated into the ITI-Dl sequence by 
cassette mutagenesis between the Eagl and Styl/Ncol sites 
shown in Table 35. Phage isolates containing the ITI- 

15 D1::III fusion gene with the EpiNE-7 changes around the PI 
position are called MA-ITT-D1E7 . 

Example 4: Fractionation of MA-ITI-D1E7 phage. 

To test if ITI-DlE7-display phage bind hNE beads, pH elution 

20 profiles were measured. Aliquots of EpiNE-7, MA-ITI-D1, and 
MA-ITI-D1E7 display phage were incubated with hNE beads for 
three hours at room temperature (RT) . The beads were washed 
and phage were eluted as described in US 5,223,409, except 
that only three pH elutions were performed. These data are 

25 in Table 215. The pH elution profile of EpiNE-7 display 
phage is as described. MA-ITI-D1E7 phage show a broad 
elution maximum around pH 5. The total fraction of MA- IT I - 
D1E7 phage obtained on pH elution from hNE beads was about 
40-fold less than that obtained using EpiNE-7 display phage. 

30 The pH elution behavior of MA-ITI-D1E7 phage bound to hNE 

beads is qualitatively similar to that seen using 
BPTI [K15L] -III-MA phage. BPTI with the K15L mutation has an 
affinity for hNE of «3 nM. (Alterations and mutations are 
indicated by giving the original (wild-type) amino-acid 

35 type, then the position, and then the new amino-acid type; 
thus K15L means change Lys 15 to Leu.) Assuming all else 
remains the same, the pH elution profile for MA-ITI-D1E7 
suggests that the affinity of the free ITI-D1E7 domain for 



hNE might be in the nM range. If this is the case, the 
substitution of the EpiNE-7 sequence in place of the ITI-Dl 
sequence around the PI region has produced a 20- to 50-fold 
increase in affinity for hNE (assuming K 1 = 60 to 150 nM for 
the unaltered ITI-Dl) . 

If EpiNE-7 and ITI-D1E7 have the same solution structure, 
these proteins present the identical amino acid sequences to 
hNE over the interaction surface. Despite this similarity, 
EpiNE-7 exhibits a roughly 1000-fold greater affinity for 
hNE than does ITI-D1E7. This observation highlights the 
importance of non-contacting secondary residues in 
modulating interaction strengths. 

Native ITI light chain is glycosylated at two positions, 
Ser 10 and Asn 45 (GEBH86) . Removal of the glycosaminoglycan 
chains has been shown to decrease the affinity of the 
inhibitor for hNE about 5-fold (SELL87) . Another 
potentially important difference between EpiNE-7 and ITI- 
D1E7 is that of net charge. The changes in BPTI that 
produce EpiNE-7 reduce the total charge on the molecule from 
+6 to +1. Sequence differences between EpiNE-7 and ITI-D1E7 
further reduce the charge on the latter to -1. Furthermore, 
the change in net charge between these two molecules arises 
from sequence diff erences ■ occurring in the central portions 
of the molecules. Position 26 is Lys in EpiNE-7 and is Thr 
in ITI-D1E7, while at position 31 these residues are Gin and 
Glu, respectively. These changes in sequence not only alter 
the net charge on the molecules but also position a 
negatively charged residue close to the interaction surface 
in ITI-D1E7. It may be that the occurrence of a negative 
charge at position 31 (which is not found in any other of 
the hNE inhibitors described here) destabilized the 
inhibitor-protease interaction. 

Example 5 : Preparation of BITI-E7 Phage 

Possible reasons for MA-ITI-D1E7 phage having lower affinity 
for hNE than do MA-EpiNE7 phage include: a) incorrect 
cleavage of the Illsignal : : ITI-D1E7 : rmaturelll fusion 
protein, b) inappropriate negative charge on the ITI-D1E7 
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domain, c) conformational or dynamic changes in the Kunitz 
backbone caused by substitutions such as Phe 4 to Ser 4 , and d) 
non-optimal amino acids in the ITI-DlE7:hNE interface, such 
as Q 34 or A X1 . 

5 To investigate the first three possibilities, we 

substituted the first four amino acids of EpiNE7 for the 
first four amino acids of ITI-D1E7 . This substitution 
should provide a peptide that can be cleaved by signal 
peptidase-I in the same manner as is the 

10 Illsignal : :EpiNE7 : :matureIII fusion. Furthermore, Phe 4 of 
BPTI is part of the hydrophobic core of the protein; 
replacement with serine may alter the stability or dynamic 
character of ITI-D1E7 unfavorably. ITI-D1E7 has a 
negatively charged Glu at position 2 while EpiNE7 has Pro. 

15 We introduced the three changes at the amino terminus of the 
ITI-D1E7 protein (KIR, E2P, and S4F) by oligonucleotide- 
directed mutagenesis to produce BITI-E7; phage that display 
BITI-E7 are called MA-BITI-E7. 

We compared the properties of the ITI-III fusion proteins 

20 displayed by phage MA-ITI-D1 and MA-BITI using Western 

analysis as described previously and found no significant 
differences in apparent size or relative abundance of the 
fusion proteins produced by either display phage strain. 
Thus, there are no large differences in the processed forms 

25 of either fusion protein displayed on the phage. By 

extension, there are also no large differences in the 
processed forms of the gene III fusion proteins displayed by 
MA-ITI-D1E7 and MA-EpiNE7 . Large changes in protein 
conformation due to altered processing are therefore not 

30 likely to be responsible for the great differences in 

binding to hNE-beads shown by MA-ITI-D1E7 and MA-EpiNE7 
display phage. 

We characterized the binding properties to hNE-beads of 
MA-BITI and MA-BITI-E7 display phage using the extended pH 
35 fractionation procedure described in US 5,223,409. The 

results are in Table 216. The pH elution profiles for MA- 
BITI and MA-BITI-E7 show significant differences from the 
profiles exhibited by MA-ITI-D1 and MA-ITI-D1E7 . In both 



28 

cases, the alterations at the putative amino terminus of the 
displayed fusion protein produce a several-fold increase in 
the fraction of the input display phage eluted from the hNE- 
beads . 

5 The binding capacity of hNE-beads for display phage 

varies among preparations of beads and with age for each 
individual preparation of beads. Thus, it is difficult to 
directly compare absolute yields of phage from elutions 
performed at different times. For example, the fraction of 

10 MA-EpiNE7 display phage recovered from hNE-beads varies 

two-fold among the experiments shown in Tables 212, 215, and 
216. However, the shapes of the pH elution profiles are 
similar. It is possible to correct somewhat for variations 
in binding capacity of hNE-beads by normalizing display 

15 phage yields to the total yield of MA— EpiNE7 phage 

recovered from the beads in a concurrent elution. When the 
data shown in Tables 212, 215, and 216 are so normalized, 
the recoveries of display phage, relative to recovered MA- 
EpiNE7, are shown in Table 10. 



Table 10: Recovery of Display phage 


Display Phage strain 


Normalized 
fraction of input 


MA-ITI-D1 


0.0067 


MA-BITI 


0 . 018 


MA-ITI-D1E7 


0.027 


MA-BITI-E7 


0.13 



Thus, the changes in the amino terminal sequence of the 
displayed protein produce a three- to five-fold increase in 
the fraction of display phage eluted from hNE-beads. 

30 In addition to increased binding, the changes introduced 

into MA-BITI-E7 produce phage that elute from hNE-beads at a 
lower pH than do the parental MA-ITI-D1E7 phage. While the 
parental display phage elute with a broad pH maximum 
centered around pH 5.0, the pH elution profile for MA-BITI- 

35 E7 display phage has a pH maximum at around pH 4.75 to pH 
4.5. 



The pH elution maximum of the MA-BITI-E7 display phage is 
between the maxima exhibited by the BPTI (K15L) and 
BPTI(K15V, R17L) display phage ( pH 4.75 and pH 4.5 to pH 
4.0, respectively) described in US 5,223,409. From the pH 
maximum exhibited by the display phage we predict that the 
BITI-E7 protein free in solution may have an affinity for 
hNE in the 100 pM range. This would represent an 
approximately ten-fold increase in affinity for hNE over 
that estimated above for ITI-D1E7. 

As was described above, Western analysis of phage 
proteins show that there are no large changes in gene III 
fusion proteins upon alteration of the amino terminal 
sequence. Thus, it is unlikely that the changes in affinity 
of display phage for hNE-beads can be attributed to large- 
scale alterations in protein folding resulting from altered 
("correct") processing of the fusion protein in the amino 
terminal mutants. The improvements in binding may in part 
be due to: 1) the decrease in the net negative charge (-1 to 
0) on the protein arising from the Glu to Pro change at 
position 2, or 2) increased protein stability resulting from 
the Ser to Phe substitution at residue 4 in the hydrophobic 
core of the protein, or 3) the combined effects of both 
substitutions . 



Example 6 : Production and properties of MA-BITI-E7-1222 and 
MA-BITI-E7-141 

Within the presumed Kunitz:hNE interface, BITI-E7 and EpiNE7 
differ at only two positions: 11 and 34. In EpiNE7 these 
residues are Thr and Val, respectively. In BITI-E7 they are 
Ala and Gin. In addition BITI-E7 has Glu at 31 while EpiNE7 
has Gin. This negative charge may influence binding 
although the residue is not directly in the interface. We 
used oligonucleotide-directed mutagenesis to investigate the 
effects of substitutions at positions 11, 31 and 34 on the 
protease : inhibitor interaction. 

ITI-D1 derivative BITI-E7-1222 is BITI-E7 with the 
alteration A11T. ITI-D1 derivative BITI-E7-141 is BITI-E7 
with the alterations E31Q and Q34V; phage that dhe presence 



of tisplay these proteins are MA-BITT-E7-1222 and MA-BITI- 
E7-141. We determined the binding properties to hNE-beads 
of MA-BITI-E7-1222 and MA-BITI-E7-141 display phage using 
the extended pH fractionation protocol described previously. 
The results are in Tables 217 (for MA-BITI-E7 and MA-BITI- 
E7-1222) and 218 (for MA-EpiNE7 and MA-BITI-E7 -14 1 ) . The 
pH elution profiles for the MA-BITI-E7 and MA-BITI-E7-1222 
phage are almost identical. Both phage strains exhibit pH 
elution profiles with identical maxima (between pH 5.0 and 
pH 4.5) as well as the same total fraction of input phage 
eluted from the hNE-beads (0.03%) . Thus, the T11A 
substitution in the displayed ITI-D1 derivative has no 
appreciable effect on the binding to hNE-beads. 

In contrast, the changes at positions 31 and 34 strongly 
affect the hNE-binding properties of the display phage. The 
elution profile pH maximum of MA-BITI-E7-14 1 phage is 
shifted to lower pH relative to the parental MA-BITI-E7 
phage. Further, the position of the maximum (between pH 4.5 
and pH 4.0) is identical to that exhibited by MA-EpiNE7 
phage in this experiment. Finally, the MA-BITI-E7-141 phage 
show a ten-fold increase, relative to the parental MA-BITI- 
E7, in the total fraction of input phage eluted from hNE- 
beads (0.3% vs 0.03%). The total fraction of MA-BITI-E7-1 4 1 
phage eluted from the hNE-beads is nearly twice that of MA- 
EpiNE7 phage. 

The above results show that binding by MA-BITI-E7-141 
display phage to hNE-beads is comparable to that of MA- 
EpiNE7 phage. If the two proteins (EpiNE7 and BITI-E7-141) 
in solution have similar affinities for hNE, then the 
affinity of the BITI-E7-141 protein for hNE is on the order 
of 1 pM. Such an affinity is approximately 100-fold greater 
than that estimated above for the parental protein (BITI-E7) 
and is 10 5 to 10 s times as great as the affinity for hNE 
reported for the intact ITI protein. 

Example 7: Mutagenesis of BITI-E7-141 

BITI-E7-141 differs from ITI-D1 at nine positions (1, 2, 4, 
15, 16, 18, 19, 31, and 34) . To obtain the protein having 
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the fewest changes from ITI-D1 while retaining high specific 
affinity for hNE, we have investigated the effects of 
reversing the changes at positions 1, 2, 4, 16, 19, 31, and 
34. The derivatives of BITI-E7-141 that were tested are 
5 MUT1619, MUTP1, and MUTT26A. The derivatives of BITI that 

were tested are AMIN01 and AMIN02 . The derivative of BITI-E7 
that was tested is MUTQE . All of these sequences are shown 
in Table 100. MUT1619 restores the ITI-D1 residues Ala 16 and 
Ser 19 . The sequence designated "MUTP1" asserts the amino 
10 acids I 15 , G 16 , S 19 in the context of BITI-E7-141. It is 

likely that M 17 and F 18 are optimal for high affinity hNE 
N binding. G 16 and S 19 occurred frequently in the high affinity 

hNE-binding BPTI-variants obtained from fractionation of a 
M library of BPTI-variants against hNE (ROBE92). Three 

*M 15 changes at the putative amino terminus of the displayed ITI- 

f|j D1 domain were introduced to produce the MA-BITI series of 

rii phage. AMINOl carries the sequence K x - E 2 while AMIN02 

g carries K^-S^ Other amino acids in the amino-terminal 

N region of these sequences are as in ITI-D1 . MUTQE is 

20 derived from BITI-E7-141 by the alteration Q31E (reasseting 
O the ITI-Dl w.t. residue) . Finally, the mutagenic 

nl oligonucleotide MUTT26A is intended to remove a potential 

site of N-linked glycosylation, N 24 -G 25 -T 26 . In the intact ITI 
molecule isolated from human serum, the light chain 
25 polypeptide is glycosylated at this site (N 45 , ODOM90) . It 
is likely that N 24 will be glycosylated if the BITI-E7-141 
protein is produced via eukaryotic expression. Such 
glycosylation may render the protein immunogenic when used 
for long-term treatment. The MUTT26A contains the 
30 alteration T26A and removes the potential glycosylation site 
with minimal changes in the overall chemical properties of 
the residue at that position. In addition, an Ala residue is 
frequently found in other BPTI homologues at position 26 
(see Table 34 of US 5,223,409). Mutagenesis was performed 
35 on ssDNA of MA-BITI-E7-141 phage. 



Example 8 : hNE-bin ding properties of mutaqenized MA-BITI-E7 
141 display phage 
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Table 219 shows pH elution data for various display phage 
eluted from hNE-beads . Total pfu applied to the beads are 
in column two. The fractions of this input pfu recovered in 
each pH fraction of the abbreviated pH elution protocol (pH 
5 7.0, pH 3.5, and pH 2.0) are in the next three columns. For 
data obtained using the extended pH elution protocol, the pH 
3.5 listing represents the sum of the fractions of input 
recovered in the pH 6.0, pH 5.5, pH 5.0, pH 4.5, pH 4.0, and 
pH 3.5 elution samples. The pH 2.0 listing is the sum of 

10 the fractions of input obtained from the pH 3.0, pH 2.5, and 
pH 2.0 elution samples. The total fraction of input pfu 
obtained throughout the pH elution protocol is in the sixth 
column. The final column of the table lists the total 
fraction of input pfu recovered, normalized to the value 

15 obtained for MA-BITI-E7-141 phage. 

Two factors must be considered when making comparisons 
among the data shown in Table 219. The first is that due to 
the kinetic nature of phage release from hNE-beads and the 
longer time involved in the extended pH elution protocol, 

20 the fraction of input pfu recovered in the pH 3.5 fraction 
will be enriched at the expense of the pH 2 . 0 fraction in 
the extended protocol relative to those values obtained in 
the abbreviated protocol. The magnitude of this effect can 
be seen by comparing the results obtained when MA-BITI-E7- 

25 141 display phage were eluted from hNE-beads using the two 

protocols. The second factor is that, for the range of 
input pfu listed in Table 219, the input pfu influences 
recovery. The greater the input pfu, the greater the total 
fraction of the input recovered in the elution. This effect 

30 is apparent when input pfu differ by more than a factor of 

about 3 to 4 . The effect can lead to an overestimate of 
affinity of display phage for hNE-beads when data from phage 
applied at higher titers is compared with that from phage 
applied at lower titers . 

35 With these caveats in mind, we can interpret the data in 

Table 219. The effects of the mutations introduced into MA- 
BITI-E7-141 display phage ("parental") on binding of display 
phage to hNE-beads can be grouped into three categories: 
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those changes that have little or no measurable effects, 
those that have moderate (2- to 3-fold) effects, and those 
that have large (>5-fold) effects. 

The MUTT26A and MUTQE changes appear to have little 
5 effect on the binding of display phage to hNE-beads . In 

terms of total pfu recovered, the display phage containing 
these alterations bind as well as the parental to hNE-beads. 
Indeed, the pH elution profiles obtained for the parental 
and the MUTT26A display phage from the extended pH elution 

10 protocol are indistinguishable. The binding of the MUTTQE 

display phage appears to be slightly reduced relative to the 
parental and, in light of the applied pfu, it is likely that 
this binding is somewhat overestimated. 

The sequence alterations introduced via the MUTP1 and 

15 MUT1619 oligonucleotides appear to reduce display phage 

binding to hNE-beads about 2- to 3-fold. In light of the 
input titers and the distributions of pfu recovered among 
the various elution fractions, it is likely that 1) both of 
these display phage have lower affinities for hNE-beads than 

20 do MA-EpiNE7 display phage, and 2) the MUT1619 display phage 
have a greater affinity for hNE-beads than do the MUTP1 
display phage. 

The sequence alterations at the amino terminus of BITI- 
E7-14 appear to reduce binding by the display phage to hNE- 
25 beads at least ten fold. The AMIN02 changes are likely to 
reduce display phage binding to a substantially greater 
extent than do the AMIN01 changes. 

On the basis of the above interpretations of the data in 
Table 219, we can conclude that: 
30 1-) The substitution of ALA for THR at position 26 in 

ITI-Dl and its derivatives has no effect on the 
interaction of the inhibitor with hNE . Thus, the 
possibility of glycosylation at Asn 24 of an inhibitor 
protein produced in eukaryotic cell culture can be 
35 avoided with no reduction in affinity for hNE. 

2.) The increase in affinity of display phage for hNE- 
beads from the changes E31Q and Q34V results 
primarily from the Val substitution at 34. 
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3. ) All three changes at the amino terminal region of 

ITI-D1 (positions 1,2, and 4) influence display 
phage binding to hNE-beads to varying extents. The 
S4F alteration seems to have about the same effect 
as does E2P. The change at position 1 appears to 
have only a small effect. 

4. ) The changes in the region around the PI residue in 

BITI-E7-141 (position 15) influence display phage 
binding to hNE . The changes A16G and P19S appear to 
reduce the affinity of the inhibitor somewhat 
(perhaps 3-fold) . The substitution of I15V further 
reduces binding. 
BITI-E7-141 differs from ITI-D1 at nine positions. From 
the discussion above, it appears likely that a high affinity 
hNE-inhibitor based on ITI-Dl could be constructed that 
would differ from the ITI-Dl seguence at only five or six 
positions. These differences would be: Pro at position 2, 
Phe at position 4, Val at position 15, Phe at position 18, 
Val at position 34, and Ala at position 26. If 

glycosylation of Asn 24 is not a concern Thr could be retained 
at 26. 



Summary: estimat ed affinities of isolated ITI-Dl derivatives 
for hNE 

25 On the basis of display phage binding to and elution from 

hNE beads, it is possible to estimate affinities for hNE 
that various derivatives of ITI-Dl may display free in 
solution. These estimates are summarized in Table 55. 

3 0 hNE Inhibitors Derived from ITI Domain 2 



In addition to hNE inhibitors derived from ITI-Dl, the 
present invention comprises hNE inhibitors derived from ITI- 
D2. These inhibitors have been produced in Pichia pastoris 
35 in good yield. EPI-HNE-4 inhibits human neutrophil elastase 
with a K D « 5 pM . 



PURIFICATION AND PROPERTIES OF EPI-HNE PROTEINS 



I. EPI-HNE Proteins. 

Example 9: Amino-acid sequences of EPI-HNE-3 and EPI-HNE-4 
Table 100 gives amino acid sequences of four human- 
neutrophil-elastase (hNE) inhibitor proteins: EPI-HNE-1 
(identical to EpiNEl), EPI-HNE-2, EPI-HNE-3, and EPI-HNE-4. 
These proteins have been derived from the parental Kunitz- 
type domains shown. Each of the proteins is shown aligned 
to the parental domain using the six cysteine residues 
(shaded) characteristic of the Kunitz-type domain. Residues 
within the inhibitor proteins that differ from those in the 
parental protein are in upper case. Entire proteins having 
the sequences EPI-HNE-1, EPI-HNE-2, EPI-HNE-3, and EPI-HNE-4 
(Table 100) have been produced. Larger proteins that 
comprise one of the hNE-inhibiting sequences are expected to 
have potent hNE-inhibitory activity; EPI-HNE-1, EPI-HNE-2, 
EPI-HNE-3, and EPI-HNE-4 are particularly preferred. It is 
expected that proteins that comprise a significant part of 
one of the hNE-inhibiting sequences found in Table 100 
(particularly if the sequence starting at or before the 
first cysteine and continuing through or beyond the last 
cysteine is retained) will exhibit potent hNE-inhibitory 
activity . 

The hNE-inhibitors EPI-HNE-1 and EPI-HNE-2 are derived 
from the bovine protein BPTI (aprotinin) . Within the 
Kunitz-type domain, these two inhibitors differ from BPTI at 
the same eight positions: K15I, R17F, I18F, I19P, R39M, 
A40G, K41N, and R42G. In addition, EPI-HNE-2 differs from 
both BPTI and EPI-HNE-1 in the presence of four additional 
residues (EAEA) present at the amino terminus. These 
residues were added to facilitate secretion of the protein 
in Pichia pastoris . 

EPI-HNE-3 is derived from the second Kunitz domain of the 
light chain of the human inter-Of-trypsin inhibitor protein 
(ITI-D2) . The amino acid sequence of EPI-HNE-3 differs from 
that of ITI-D2 (3-58) at only four positions: R15I, I18F, 
Q19P and L20R. EPI-HNE-4 differs from EPI-HNE-3 by the 
substitution A3E (the amino-terminal residue) which both 
facilitates secretion of the protein in P. pastoris and 
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improves the K D for hNE . Table 602 gives some physical 
properties of the hNE inhibitor proteins. All four proteins 
are small, high-affinity (K 1 =2 to 6 pM) , fast-acting (k on =4 
to 11 xlO 6 M^s -1 ) inhibitors of hNE. 

5 

II. Production of the hNE -inhibitors EPI-HNE-2, EPI-HNE- 

3, and EPI-HNE-4. 

Example 10: Pichla pastor is production system. 

10 Transformed strains of Pichia pastoris were used to express 
the various EPI-HNE proteins derived from BPTI and ITI-D2. 
Protein expression cassettes are cloned into the plasmid 
pHIL-D2 using the Bs tBI and EcoRI sites (Table 111) . The 
DNA sequence of pHIL-D2 is given in Table 250. The cloned 

15 gene is under transcriptional control of P. pastoris 
upstream (labeled "aoxl 5'") aoxl gene promoter and 
regulatory sequences (dark shaded region) and downstream 
polyadenylation and transcription termination sequences 
(second cross-hatched region, labeled "aoxl 3'"). P. 

20 pastoris GS115 is a mutant strain containing a non- 
functional histidinol dehydrogenase {his4) gene. The his4 
gene contained on plasmid pHIL-D2 and its derivatives can be 
used to complement the histidine deficiency in the host 
strain. Linearization of plasmid pHIL-D2 at the indicated 

25 SacI site directs plasmid incorporation into the host genome 

at the aoxl locus by homologous recombination during 
transformation. Strains of P. pastoris containing 
integrated copies of the expression plasmid will express 
protein genes under control of the aoxl promoter when the 

30 promoter is activated by growth in the presence of methanol 
as the sole carbon source. 

We have used this high density Pichia pastoris production 
system to produce proteins by secretion into the cell CM. 
Expression plasmids were constructed by ligating synthetic 

35 DNA sequences encoding the S. cerevisiae mating factor a 

prepro peptide fused directly to the amino terminus of the 
desired hNE inhibitor into the plasmid pHIL-D2 using the 
Bs tBI and the EcoRI sites shown. Table 251 gives the DNA 
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sequence of a BstBI-to-EcoRI insert that converts pHIL-D2 
into pHIL-D2 (MFa-PrePro: :EPI-HNE-3) . In this construction, 
the fusion protein is placed under control of the upstream 
inducible P. pastoris aoxl gene promoter and the downstream 
5 aoxl gene transcription termination and polyadenylat ion 

sequences. Expression plasmids were linearized by Sad 
digestion and the linear DNA was incorporated by homologous 
recombination into the genome of the P. pastoris strain 
GS115 by spheroplast transformation. Regenerated 

10 spheroplasts were selected for growth in the absence of 

added histidine, replated, and individual isolates were 
screened for methanol utilization phenotype {mut + ) , secretion 
levels, and gene dose (estimated via Southern hybridization 
experiments) . High level secretion stains were selected for 

15 production of hNE inhibitors: PEY-33 for production of EPI- 
HNE-2 and PEY-43 for production of EPI-HNE-3 . In both of 
these strains, we estimate that four copies of the 
expression plasmid are integrated as a tandem array into the 
aoxl gene locus. 

20 To facilitate alteration of the Kunitz-domain encoding 

segment of pHIL-D2 derived plasmids, we removed two 
restriction sites given in Table 111: the BstBl at 4780 and 
the Aatll site at 54 98. Thus, the Kunitz-domain encoding 
segment is bounded by unique Aatll and EcoRI sites. The new 

25 plasmids are called pD2pick ( "insert " ) where "insert" defines 
the domain encoded under control of the aoxl promoter. 
Table 253 gives the DNA sequence of pD2pick (MFa :: EPI-HNE-3 ) . 
Table 254 gives a list of restriction sites in 
pD2pick(MFa: :EPI-HNE-3) . 

30 EPI-HNE-4 is encoded by pD2pick (MFaPrePro : : EPI-HNE-4 ) 

which differs from pHIL-D2 in that: 1) the Aatll /EcoRI 
segment of the sequence given in Table 251 is replaced by 
the segment shown in Table 2 52 and 2) the changes in the 
restriction sites discussed above have been made. Strain 

35 PEY-53 is P. pastoris GS115 transformed with 
pD2pick(MFa: :EPI-HNE-4) . 
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Example 11: Protein Production 

To produce the proteins, P. pastoris strains were grown in 
mixed-feed fermentations similar to the procedure described 
by Digan et al. (DIGA89) . Although others have reported 
5 production of Kunitz domains in P. pastoris , it is well 

known that many secretion systems involve proteases. Thus, 
it is not automatic that an altered Kunitz domain having a 
high potency in inhibiting hNE could be secreted from P. 
pastoris because the new inhibitor might inhibit some key 
10 enzyme in the secretion pathway. Nevertheless, we have 

found that P. pastoris can secrete hNE inhibitors in high 
yield . 

Briefly, cultures were first grown in batch mode with 
glycerol as the carbon source. Following exhaustion of 

15 glycerol, the culture was grown for about four hours in 

glycerol-limited feed mode to further increase cell mass and 
to derepress the aoxl promoter. In the final production 
phase, the culture was grown in methanol-limited feed mode. 
During this phase, the aoxl promoter is fully active and 

20 protein is secreted into the CM. 

Table 607 and Table 608 give the kinetics of cell growth 
(estimated as A 600 ) and protein secretion (mg/1) for cultures 
of PEY-33 and PEY-43 during the methanol-limited feed 
portions of the relevant fermentations . Concentrations of 

25 the inhibitor proteins in the fermentation cultures were 

determined from in vitro assays of hNE inhibition by diluted 
aliguots of cell-free culture media obtained at the times 
indicated. Despite similarities in gene dose, fermentation 
conditions, cell densities, and secretion kinetics, the 

30 final concentrations of inhibitor proteins secreted by the 
two strains differ by nearly an order of magnitude. The 
final concentration of EPI-HNE-2 in the PEY-33 fermentation 
CM was 720 mg/1. The final concentration of EPI-HNE-3 in 
the PEY-43 fermentation CM was 85 mg/1. The differences in 

35 final secreted protein concentrations may result from 

idiosyncratic differences in the efficiencies with which the 
yeast synthesis and processing systems interact with the 
different protein sequences. 
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Strain PEY-33 secreted EPI-HNE-2 protein into the CM as a 
single molecular species which amino acid composition and N- 
terminal sequencing reveled to be the correctly-processed 
Kunitz domain with the sequence shown in Table 601. The 
5 major molecular species produced by PEY-43 cultures was the 
properly-processed EPI-HNE-3 protein. However, this strain 
also secreted a small amount (about 15% to 20% of the total 
EPI-HNE-3) of incorrectly-processed material. This material 
proved to be a mixture of proteins with amino terminal 
10 extensions (primarily nine or seven residues in length) 

arising from incorrect cleavage of the MF a PrePro leader 
peptide from the mature Kunitz domain. The correctly 
processed protein was purified substantially free of these 
contaminants as described below. 

15 

III. Purification of the hNE-inhibitors EPI-HNE-2 and 

EPI-HNE-3. 



The proteins can be readily purified from fermenter CM by 
20 standard biochemical techniques. The specific purification 

procedure varies with the specific properties of each 
protein as described below. 



Example 12 : Purification of EPI-HNE-2. 

25 Table 603 gives particulars of the purification of EPI-HNE- 
2, lot 1. The PEY-33 fermenter culture was harvested by 
centrifugation at 8000 x g for 15 min and the cell pellet 
was discarded. The 3.3 liter supernatant fraction was 
microf iltered used a Minitan Ultrafiltration System 

30 (Millipore Corporation, Bedford, MA) equipped with four 0.2)1 

filter packets. 

The filtrate obtained from the microf iltration step was 
used in two subsequent ultrafiltration steps. First, two 
30K ultrafiltrations were performed on the 0.2y 

35 microf iltrate using the Minitan apparatus equipped with 
eight 30,000 NMWL polysulfone filter plates (#PLTK0MP04, 
Millipore Corporation, Bedford, MA) . The retentate solution 
from the first 30K ultrafiltration was diluted with 10 mM 



NaCitrate, pH=3.5, and subjected to a second 30K 
ultrafiltration. The two 30K ultraf iltrates were combined 
to give a final volume of 5 liters containing about 1.4 gram 
of EPI-HNE-2 protein (estimated from hNE-inhibition 
measurements) . 

The 30K ultraf iltrate was concentrated with change of 
buffer in the second ultrafiltration step using the Minitan 
apparatus equipped with eight 5,000 NMWL filter plates 
(#PLCC0MP04, Millipore Corporation, Bedford, MA) . At two 
times, during the 5K ultrafiltration, the retentate solution 
was diluted from about 300 ml to 1.5 liters with 10 mM 
NaCitrate, pH=3 . 5 . The final 5K ultrafiltration retentate 
(Ca. 200 ml) was diluted to a final volume of 1000 ml with 
10 mM NaCitrate, pH-3.5. 

EPI-HNE-2 protein was obtained from the 5K 
ultrafiltration retentate solution by ammonium sulfate 
precipitation at RT. 100 ml of 0.25 M ammonium acetate, 
pH=3.2, (1/10 volume) was added to the 5K ultrafiltration 
retentate, followed by one final volume (1.1 liter) of 3 M 
ammonium sulfate. Following a 30 minute incubation at RT, 
precipitated material was pelleted by centrif ugation at 
10,000 x g for 45 minutes. The supernatant solution was 
removed, the pellet was dissolved in water in a final volume 
of 400 ml, and the ammonium sulfate precipitation was 
repeated using the ratios described above. The pellet from 
the second ammonium sulfate precipitation was dissolved in 
50 mM sodium acetate, pH=3 . 5 at a final volume of 300 ml. 

Residual ammonium sulfate was removed from the EPI-HNE-2 
preparation by ion exchange chromatography. The solution 
from the ammonium sulfate precipitation step was applied to 
a strong cation-exchange column (50 ml bed volume Macroprep 
50S) (Bio-Rad Laboratories, Inc, Hercules, CA) previously 
equilibrated with 10 mM NaCitrate, pH=3 . 5 . After loading, 
the column was washed with 300 ml of 10 mM NaCitrate, 
pH=3.5. EPI-HNE-2 was then batch-eluted from the column 
with 300 ml of 50 mM NH 4 OAc, pH=6.2. Fractions containing 
EPI-HNE-2 activity were pooled and the resulting solution 
was lyophilized. The dried protein powder was dissolved in 



50 ml dH 2 0 and the solution was passed through a 0.2]a filter 
(#4192, Gelman Sciences, Ann Arbor, MI). The concentration 
of the active inhibitor in the final stock solution was 
determined to be 2 mM (13.5 mg/ml). This stock solution 
(EPI-HNE-2, Lot 1) has been stored as 1 ml aliquots at 4°C 
and -70°C for more than 11 months with no loss in activity.' 

Table 603 summarizes the yields and relative purity of 
EPI-HNE-2 at various steps in the purification procedure. 
The overall yield of the purification procedure was about 
30%. The major source of loss was retention of material in 
the retentate fractions of the 0.2p microf iltration and 30k 
ultrafiltration steps. 

Example 13: Purification of EPI-HNE-3 . 

Purification of EPI-HNE-3, lot 1, is set out in Table 604. 
The PEY-43 fermenter culture was harvested by centrif ugation 
at 8,000 x g for 15 min and the cell pellet was discarded. 
The supernatant solution (3100 ml) was microf iltered through 
0.2p. Minitan packets (4 packets). After the concentration, 
a diaf iltration of the retentate was performed so that the 
final filtrate volume from the 0.2u filtration was 3300 ml. 

A 30K ultrafiltration was performed on the filtrate from 
the 0.2u microf iltration step. When the retentate volume 
had been reduced to 250 ml, a diaf iltration of the retentate 
was performed at a constant retentate volume (250 ml) for 30 
min at a rate of 10 ml/min. The resulting final volume of 
filtrate was 3260 ml. 

EPI-HNE-3 protein and other medium components were found 
to precipitate from solution when the fermenter CM was 
concentrated. For this reason, the 5k ultrafiltration step 
was not performed. 

Properly processed EPI-HNE-3 was purified substantially 
free of mis-processed forms and other fermenter culture 
components by ion exchange chromatography. A 30 ml bed 
volume strong cation ion exchange column (Macroprep 50S) was 
equilibrated with 10 mM NaCitrate pH=3 . 5 . The 30K 
ultrafiltration filtrate was applied to the column and 
binding of EPI-HNE-3 to the column was confirmed by 



demonstrating the complete loss of inhibitor activity in the 

column flow through. The column was then washed with 300 ml 

of 10 mM NaCitrate, pH=3 . 5 . 

To remove EPI-HNE-3 from the column, we sequentially 

eluted it with 300 ml volumes of the following solutions: 
100 mM ammonium acetate, pH=3 . 5 
50 mM ammonium acetate, pH=4.8 
50 mM ammonium acetate, pH=6.0 
50 mM ammonium acetate, pH=6.0, 0.1 M NaCl 
50 mM ammonium acetate, pH=6.0, 0.2 M NaCl 
50 mM ammonium acetate, pH=6.0, 0.3 M NaCl 
50 mM ammonium acetate, pH=6.0, 0.4 M NaCl 
50 mN Tris/Cl pH=8.0, 1.0 NaCl 

The majority of the EPI-HNE-3 eluted in two 50 mM ammonium 
acetate, pH=6.0 fractions. The 0.1 M NaCl fraction 
contained about 19% of the input EPI-HNE-3 activity (28 mg 
of 159 mg input) and essentially all of the mis-processed 
forms of EPI-HNE-3. The 0 . 2M NaCl fraction contained about 
72% (114 mg) of the input EPI-HNE-3 and was almost 
completely free of the higher molecular weight mis-processed 
forms and other UV-absorbing contaminants. The fractions 
from the 50 mM ammonium acetate, pH=6.0, 0.2 M NaCl elution 
having the highest concentrations of EPI-HNE-3 were combined 
(95 mg) . 

An ammonium sulfate precipitation was performed on the 
0.2 M NaCl, pH=6.0 ion exchange column eluate. 800 ml of 3 
M ammonium sulfate was added to 160 ml of eluate solution 
(final ammonium sulfate concentration = 2.5 M) and the 
mixture was incubated at RT for 18 hours. The precipitated 
material was then pelleted by centrif ugation at 10,000 x g 
for 45 minutes. The supernatant fluid was discarded and the 
pelleted material was dissolved in 100 ml of water. 

Residual ammonium sulfate was removed from the EPI-HNE-3 
preparation by batch ion exchange chromatography. The pH of 
the protein solution was adjusted to 3 . 0 with diluted (1/10) 
HOAc and the solution was then applied to a 10 ml bed volume 
Macroprep 50S column that had been equilibrated with 10 mM 
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NaCitrate, pH=3 . 5 . Following sample loading, the column was 
washed with 100 ml of 10 mM NaCitrate, pH=3 . 5 followed by 
100 ml of dH 2 0. EPI-HNE-3 was eluted from the column with 
100 ml of 50 mM NH 4 C0 3 , pH=9.0. pH9 fractions having the 
5 highest concentrations of EPI-HNE-3 were combined (60 mg) 
and stored at 4°C for 7 days before lyophili za t ion . 

The lyophilized protein powder was dissolved in 26 ml 
dH 2 0 and the solution was passed through a 0.2u filter 
(#4912, Gelman Sciences, Ann Arbor, MI) . The concentration 

10 of active inhibitor in the final stock solution was found to 

be 250 uM (1.5 mg/ml) . This stock solution (EPI-HNE-3, Lot 
1) has been stored as 1 ml aliquots at -70°C for more than 
six months with no loss of activity. EPI-HNE-3 stored in 
water solution (without any buffering) deteriorated when 

15 kept at 4°C. After five months, about 70% of the material 
was active with a K A of about 12 pM . 

Table 604 gives the yield and relative purity of EPI-HNE- 
3 at various steps in the purification procedure. A major 
purification step occurred at the first ion exchange 

20 chromatography procedure. The ammonium sulfate 

precipitation step provided only a small degree of further 
purification. Some loss of inhibitor activity occurred on 
incubation at pH=9 (See pH stability data) . The production 
and purification of EPI-HNE-1 and EPI-HNE-4 were analogous 

25 to that of EPI-HNE-2 and EPI-HNE-3. 

Example 14: Tricine-PAGE Analysis of EPI-HNE-2 and EPI-HNE- 
3 . 

The high resolution tricine gel system of Schagger and von 
30 Jagow (SCHA87) was used to analyze the purified proteins 
produced {vide supra) . For good resolution of the low 
molecular weight EPI-HNE proteins we used a 16.5% resolving 
layer with 10% separating and 4% stacking layers. Following 
silver staining, we inspected a gel having: 
35 Lane 1: EPI-HNE-2 25 ng, 

Lane 2: EPI-HNE-2 50 ng, 

Lane 3: EPI-HNE-2 100 ng, 

Lane 4: EPI-HNE-2 200 ng, 
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Lane 5: EPI-HNE-3 25 ng, 
Lane 6: EPI-HNE-3 50 ng, 
Lane 7: EPI-HNE-3 100 ng, 
Lane 8: EPI-HNE-3 200 ng, and 

Lane 9: Molecular-weight standards: RPN 755, (Amersham 

Corporation, Arlington Heights, IL) . 
Stained proteins visible on the gel and their molecular 
weights in Daltons are: ovalbumin (46,000), carbonic 
anhydrase (30,000), trypsin inhibitor (21,500), lysozyme 
(14,300), and aprotinin (6,500). The amount of protein 
loaded was determined from measurements of hNE-inhibi tion . 
We found the following features. EPI-HNE-2, Lot 1 shows a 
single staining band of the anticipated size (ca. 6,700 D) 
at all loadings. Similarly, EPI-HNE-3, Lot 1 protein shows 
a single staining band of the anticipated size (ca. 6,200 
D) . At the highest loading, traces of the higher molecular 
weight (ca. 7,100 D) incorrectly processed form can be 
detected. On the basis of silver-stained high-resolution 
PAGE analysis, the purity of both protein preparations is 
assessed to be significantly greater than 95%. 

IV. Properties of EPI-HNE-2 and. EPI-HNE-3. 

A. Inhibition of hNE . 

Example 15: Measured K ^ s of EPI-HNE proteins for hNE 
Inhibition constants for the proteins reacting with hNE (K ± ) 
were determined using RT measurements of hydrolysis of a 
fluorogenic substrate (N-methoxysuccinyl-Ala-Ala-Pro-Val-7- 
araino-4-methylcoumarin, Sigma M-9771) by hNE. For these 
measurements, aliquots of the appropriately diluted 
inhibitor stocks were added to 2 ml solutions of 0.847 nM 
hNE in reaction buffer (50 mM Tris-Cl, pH=8.0, 150 mM NaCl, 
1 mM CaCl 2 , 0.25% Triton-X-100 ) in plastic fluorescence 
cuvettes. The reactions were incubated at RT for 30 
minutes. At the end of the incubation period, the 
fluorogenic substrate was added at a concentration of 25 uM 
and the time course for increase in fluorescence at 470 nm 
(excitation at 380 nm) due to enzymatic substrate cleavage 
was recorded using a spectrof luorimeter ( Perkin-Elmer 650- 



15) and strip chart recorder. We did not correct for 
competition between substrate and inhibitor because (S 0 /K m ) 
is 0.07 (where S 0 is the substrate concentration and K m is 
the binding constant of the substrate for hNE) . K i is 
related to K apparent by K ± = K apparent x (1/ (1 + (S 0 /KJ ) ) . The 
correction is small compared to the possible errors in 
Apparent- Rates of enzymatic substrate cleavage were 
determined from the linear slopes of the recorded increases 
in fluorescence. The percent residual activity of hNE in 
the presence of the inhibitor was calculated as the 
percentage of the rate of fluorescence increase observed in 
the presence of the inhibitor to that observed when no added 
inhibitor was present. 

We recorded data used to determine K ± for EPI-HNE-2 and 
EPI-HNE-3 reacting with hNE. Data obtained as described 
above are recorded as percent residual activity plotted as a 
function of added inhibitor. Values for K ± and for active 
inhibitor concentration in the stock are obtained from a 
least-squares fit program. From the data, K ± values for EPI- 
HNE-2 and for EPI-HNE-3 reacting with hNE at RT were 
calculated to be 4.8 pM and 6.2 pM, respectively. 
Determinations of K i for EPI-HNE-2 and EPI-HNE-3 reacting 
with hNE are given in Table 610 and Table 611. 

The kinetic on-rates for the inhibitors reacting with hNE 
(k on ) were determined from measurements of progressive 
inhibition of substrate hydrolytic activity by hNE following 
addition of inhibitor. For these experiments, a known 
concentration of inhibitor was added to a solution of hNE 
(0.847 nM) and substrate (25 uM) in 2 ml of reaction buffer 
in a plastic fluorescence cuvette. The change in 
fluorescence was recorded continuously following addition of 
the inhibitor. In these experiments, sample fluorescence 
did not increase linearly with time. Instead, the rate of 
fluorescence steadily decreased reflecting increasing 
inhibition of hNE by the added inhibitor. The enzymatic 
rate at selected times following addition of the inhibitor 
was determined from the slope of the tangent to the 
fluorescence time course at that time. 



The kinetic constant k on for EPI-HNE-2 reacting with hNE 
was determined as follows. EPI-HNE-2 at 1.3 nM was added to 
buffer containing 0.867 nM hNE (I:E = 1.5:1) at time 0. 
Measured percent residual activity was recorded as a 
function of time after addition of inhibitor. A least- 
squares fit program was used to obtain the value of k on = 4.0 
x 10 s M _1 s" 1 . 

The kinetic off rate, k off , is calculated from the 
measured values of K i and k on as : 

k off = K D x k on 
The values from such measurements are included in Table 602. 
The EPI -HNE proteins are small, high affinity, fast acting 
inhibitors of hNE. 

B . Specificity. 

Example 16: Specificity of EPI-HNE proteins 

We attempted to determine inhibition constants for EPI-HNE 
proteins reacting with several serine proteases. The 
results are summarized in Table 605. In all cases except 
chymotrypsin, we were unable to observe any inhibition even 
when 10 to 100 ]iM inhibitor was added to enzyme at 
concentrations in the nM range. In Table 605, our 
calculated values for K ± (for the enzymes other than 
chymotrypsin) are based on the conservative assumption of 
less than 10% inhibition at the highest concentrations of 
inhibitor tested. For chymotrypsin, the K± is about 10 uM 
and is probably not specific. 

C. In Vitro Stability. 

Example 17: Resistance to Oxidative Inactivation . 
Table 620 shows measurements of the susceptibility of EPI- 
HNE proteins to oxidative inactivation as compared with that 
of two other natural protein hNE inhibitors: a 1 Protease 
Inhibitor (API) and Secretory Leucocyte Protease Inhibitor 
(SLPI) . API (10 ]M) , SLPI (8.5 pM) , EPI-HNE-1 (5 ]lM) , EPI- 
HNE-2 (10 nM), EPI-HNE-3 (10 yiM) , and EPI-HNE-4 (10 jjM) were 
exposed to the potent oxidizing agent, Chloramine-T, at the 
indicated oxidant : inhibitor ratios in 50 mM phosphate 
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buffer, pH=7 . 0 for 20 minutes at RT . At the end of the 
incubation period, the oxidation reactions were quenched by 
adding methionine to a final concentration of 4 mM. After a 
further 10 minute incubation, the quenched reactions were 
diluted and assayed for residual inhibitor activity in our 
standard hNE-inhibition assay. 

Both API and SLPI are inactivated by low molar ratios of 
oxidant to inhibitor. The Chloramine-T : protein molar ratios 
required for 50% inhibition of API and SLPI are about 1:1 
and 2:1, respectively. These ratios correspond well with 
the reported presence of two and four readily oxidized 
methionine residues in API and SLPI, respectively. In 
contrast, all four EPI-HNE proteins retain essentially 
complete hNE-inhibition activity following exposure to 
Chloramine-T at all molar ratios tested (up to 50:1, in the 
cases of EPI-HNE-3 and EPI-HNE-4) . Neither EPI-HNE-3 nor 
EPI-HNE-4 contain any methionine residues. In contrast, 
EPI-HNE-1 and EPI-HNE-2 each contains two methionine 
residues (see Table 100) . The resistance of these proteins 
to oxidative inactivation indicates that the methionine 
residues are either inaccessible to the oxidant or are 
located in a region of the protein that does not interact 
with hNE. 

Example 18: pH Stability. 

Table 612 shows the results of measurements of the pH 
stability of EPI-HNE proteins. The stability of the 
proteins to exposure to pH conditions in the range of pH 1 
to pH 10 was assessed by maintaining the inhibitors in 
buffers of defined pH at 37°C for 18 hours and determining 
the residual hNE inhibitory activity in the standard hNE- 
inhibition assay. Proteins were incubated at a 
concentration of 1 uM. The buffers shown in Table 14 were 
formulated as described (STOL90) and used in the pH ranges 
indicated : 
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Table 14: Buffers used in stability studies 


Buffer 


Lowest pH 


Highest pH 


Glycine-HCl 


1 


2.99 


Citrate-Phosphate 


3 


7 


Phosphate 


7 


8 


Glycine-NaOH 


8.5 


10 



Both BPTI-derived inhibitors, EPI-HNE-1 and EPI-HNE-2, 
are stable at all pH values tested. EPI-HNE-3 and EPI-HNE- 
4, the inhibitors derived from the human protein Kunitz-type 
domain, were stable when incubated at low pH, but showed 
some loss of activity at high pH . When incubated at 37°C for 
18 hours at pH= 7.5, the EPI-HNE-3 preparation lost 10 to 
15% of its hNE-inhibition activity. EPI-HNE-4 retains 
almost full activity to pH 8.5. Activity of the ITI-D2- 
derived inhibitor declined sharply at higher pH levels so 
that at pH 10 only 30% of the original activity remained. 
The sensitivity of EPI-HNE-3 to incubation at high pH 
probably explains the loss of activity of the protein in the 
final purification step noted previously. 

Example 19: Temperature Stability. 

The stability of EPI-HNE proteins to temperatures in the 
range 0°C to 95°C was assessed by incubating the inhibitors 
for thirty minutes at various temperatures and determining 
residual inhibitory activity for hNE . In these experiments, 
protein concentrations were 1 \iM in phosphate buffer at 
pH=7 . As is shown in Table 630, the four inhibitors are 
quite temperature stable. 

EPI-HNE-1 and EPI-HNE-2 maintain full activity at all 
temperatures below about 90°C. EPI-HNE-3 and EPI-HNE-4 
maintain full inhibitory activity when incubated at 
temperatures below 65°C . The activity of the protein 
declines somewhat at higher temperatures. However, all 
three proteins retain more than -50% activity even when 
incubated at 95°C for 30 minutes. 



Example 20: ROUTES to OTHER hNE- INHIBITORY SEQUENCES: 
The present invention demonstrates that very high-affinity 
hNE inhibitors can be devised from Kunitz domains of human 
origin with very few amino-acid substitutions. It is 
believed that almost any Kunitz domain can be made into a 
potent and specific hNE inhibitor with eight or fewer 
substitutions. In particular, any one of the known human 
Kunitz domains could be remodeled to provide a highly 
stable, highly potent, and highly selective hNE inhibitor. 
There are at least three routes to hNE inhibitory Kunitz 
domains: 1) replacement of segments known to be involved in 
specifying hNE binding, 2) replacement of single residues 
thought to be important for hNE binding, and 3) use of 
libraries of Kunitz domains to select hNE inhibitors. 

Example 21: Substit ution of Segments in Kunitz Domains 
Table 100 shows the amino-acid sequences of 11 human Kunitz 
domains. These sequences have been broken into ten segments: 
1:N terminus-residue 4; 2 .-residue 5; 3: 6-9 (or 9a); 4:10-13; 
5:14; 6:15-21; 7:22-30, 8:31-36; 8:37-38; 9:39-42; and 
10:43-C terminus (or 42a-C terminus). 

Segments 1, 3, 5, 7, and 9 contain residues that strongly 
influence the binding properties of Kunitz domains and are 
double underscored in the Consensus Kunitz Domain of Table 
100. Other than segment 1, all the segments are the same 
length except for TFPI-2 Domain 2 which carries an extra 
residue in segment 2 and two extra residues in segment 10. 

Segment 1 is at the amino terminus and influences the 
binding by affecting the stability and dynamics of the 
protein. Segments 3, 5, 7, and 9 contain residues that 
contact serine proteases when a Kunitz domain binds in the 
active site. High-affinity hNE inhibition requires a 

molecule that is highly complementary to hNE. Segments 3, 
5, 7, and 9 supply the amino acids that contact the 
protease. The sequences in segments 1, 3, 5, 7, and 9 must 
work together in the context supplied by each other and the 
other segments. Nevertheless, we have demonstrated that very 
many different sequences are capable of high-affinity hNE 



It may be desirable to have an hNE inhibitor that is 
highly similar to a human protein to reduce the chance of 
immunogenic! ty. Candidate high-affinity hNE inhibitor 
protein sequences may be obtained by taking an aprotonin- 
type Kunitz domain that strongly or very strongly inhibits 
hNE, and replacing one, two, three, four or all of segments 
2, 4, 6, 8, and 10 with the corresponding segment from a 
human Kunitz domain, such as those listed in Table 100, or 
other domain known to have relatively low immunogenicity in 
humans. (Each of segments 2, 4, 6, 8, and 10 may be taken 
from the same human domain, or they may be taken from 
different human domains.) Alternatively, a reduced 
immunogenicity, high hNE inhibiting domain may be obtained 
by taking one of the human aprotonin-type Kunitz domains and 
replacing one, two, three or all of segments 3, 5, 7 and 9 
(and preferably also segment 1) with the corresponding 
segment from one or more aprotonin-like Kunitz domains that 
strongly or very strongly inhibit hNE. In making these 
humanized hNE inhibitors, one may, of course, use, rather 
than a segment identical to that of one of the 

aforementioned source proteins, a segment which differs from 
the native source segment by one or more conservative 
modifications. Such differences should, of course, be taken 
with due consideration for their possible effect on 
inhibitory activity and/or immunogenicity. In some cases, 
it may be advantageous that the segment be a hybrid of 
corresponding segments from two or more human domains (in 
the case of segments 2, 4, 6, 8 and 10) or from two or more 
strong or very strong hNE inhibitor domains (in the case of 
segments 3, 5, 7, and 9) . Segment 1 may correspond to the 
segment 1 of a strong or very strong hNE inhibitor, or the 
segment 1 of a human aprotonin-like Kunitz domain, or be a 
chimera of segment l's from both. 

The proteins DPI. 1.1, DPI. 2.1, DPI. 3.1, DPI. 4.1, DPI. 5.1, 
DPI .6.3, DPI. 7.1, DPI. 8.1, and DPI. 9.1 were designed in this 
way. DPI. 1.1 is derived from App-I by replacing segments 3, 
5, 7, and 9 with the corresponding segments from EPI-HNE-1. 
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DPI. 2.1 is derived from TFPI2-D1 by replacing segments 3, 5, 
7, and 9 with the corresponding residues from EPI-HNE-1. 
DPI. 3.1 is derived from TFPI2-D2 by replacing residues 9a-21 
with residues 10-21 of EPI-HNE-4 and replacing residues' 31- 
42b with residues 31-42 of EPI-HNE-4. DPI . 4 . 1 is derived 
from TFPI2-D3 by replacing segments 3, 5, 7, and 9 with the 
corresponding residues from MUTQE . DPI . 5 . 1 is derived from 
LACI-D1 by replacing segments 3, 5, 7, and 9 with the 
corresponding residues from MUTQE. DPI . 6 . 1 is derived from 
LACI-D2 by replacing segments 3, 5, 7 , and 9 with the 
corresponding residues from MUTQE . DPI . 7 . 1 is derived from 
LACI-D3 by replacing segments 3, 5, 7 , 9 with the 
corresponding residues from EPI-HNE-4. DPI . 8 . 1 is derived 
from the A3 collogen Kunitz domain by substitution of 
segments 3, 5, 7, and 9 from EPI-HNE-4. DPI . 9 . 1 is derived 
from the HKI B9 domain by replacing segments 3, 5, 7, and 9 
with the corresponding residues from EPI-HNE-4. 

While the above-described chimera constitute preferred 
embodiments of the present invention, the invention is not 
limited to these chimera. 

Example 22: Point s ubstitutions in Kunitz Domains 

In this example, certain substitution mutations are 
discussed. It must be emphasized that this example 
describes preferred embodiments of the invention, and is not 
intended to limit the invention. 

All of the protein sequences mentioned in this example 
are to be found in Table 100. Designed protease inhibitors 
are designated "DPI" and are derived from human Kunitz 
domains (also listed in Table 100) . Each of the sequences 
designated DPI . i . 2 (for i = 1 to 9) is derived from the 
domain two above it in the table by making minimal point 
mutations. Each of the sequences designated DPI . i . 3 (for 
i = 1 to 9) is derived from the sequence three above it by' 
more extensive mutations intended to increase affinity. For 
some parental domains, additional examples are given. The 
sequences designated DPI.i.l are discussed in Example 21. 
The most important positions are 18 and 15. Any Kunitz 



domain is likely to become a good hNE inhibitor if Val or 
lie is at 15 (with lie being preferred) and Phe is at 18. 
(However, these features are not necessarily required for 
such activity.) 

If a Kunitz domain has Phe at 18 and either lie or Val at 
15 and is not a good hNE inhibitor, there may be one or more 
residues in the interface preventing proper binding. 

The Kunitz domains having very high affinity for hNE 
herein disclosed (as listed in Table 100) have no charged 
groups at residues 10, 12 through 19, 21, and 32 through 42. 
At position 11, only neutral and positively charged groups 
have been observed in very high affinity hNE inhibitors. At 
position 31, only neutral and negatively charged groups have 
been observed in high-affinity hNE inhibitors. If a 
parental Kunitz domain has a charged group at any of those 
positions where only neutral groups have been observed, then 
each of the charged groups is preferably changed to an 
uncharged group picked from the possibilities in Table 790 
as the next step in improving binding to hNE. Similarly, 
negatively charged groups at 11 and 19 and positively 
charged groups at 31 are preferably replaced by groups 
picked from Table 790. 

At position 10, Tyr, Ser, and Val are seen in high- 
affinity hNE inhibitors. Asn or Ala may be allowed since 
this position may not contact hNE. At position 11, Thr, 
Ala, and Arg have been seen in high-affinity hNE inhibitors. 

Gin and Pro are very common at 11 in Kunitz domains and may 
be acceptable. Position 12 is almost always Gly. If 12 is 
not Gly, try changing it to Gly. 

All of the high-affinity hNE inhibitors produced so far 
have Pro 13 , but it has not been shown that this is required. 
Many (62.5%) Kunitz domains have Pro 13 . If 13 is not Pro, 
then changing to Pro may improve the hNE affinity. Val, 
Ala, Leu, or lie may also be acceptable here. 

Position 14 is Cys . It is possible to make domains 
highly similar to Kunitz domains in which the 14-38 
disulfide is omitted. Such domains are likely to be less 
stable than true Kunitz domains having the three standard 
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disulfides . 

Position 15 is preferably lie or Val . lie is more 
preferred . 

Most Kunitz domains (82%) have either Gly or Ala at 16 
5 and this may be quite important. If residue 16 is not Gly 
or Ala, change 16 to either Gly or Ala; Ala is preferred. 
Position 17 in very potent hNE inhibitors has either Phe or 
Met; those having lie or Leu at 17 are less potent. Phe is 
preferred. Met should be used only if resistance to 

10 oxidation is not important. Position 18 is Phe. 

It has been shown that high-affinity hNE inhibitors may 
have either Pro or Ser at position 19. Gin or Lys at 
position 19 may be allowed. At position 21, Tyr and Trp 
have been seen in very high affinity hNE inhibitors; Phe may 

15 also work. 

At position 31, Gin, Glu, and Val have been observed in 
high affinity hNE inhibitors. Since this is on the edge of 
the binding interface, other types are likely to work well. 
One should avoid basic types (Arg and Lys) . At position 32, 

20 Thr and Leu have been observed in high-affinity hNE 

inhibitors. This residue may not make direct contact and 
other uncharged types may work well. Pro is very common 
here. Ser has been seen and is similar to Thr. Ala has 
been seen in natural Kunitz domains and is unlikely to make 

25 any conflict. Position 33 is always Phe in Kunitz domains. 

It appears that many amino acid types may be placed at 
position 34 while retaining high affinity for hNE; large 
hydrophobic residues (Phe, Trp, Tyr) are unfavorable. Val 
and Pro are most preferred at 34. Positions 35-38 contain 

30 the sequence Tyr-Gly-Gly-Cys . There is a little diversity 
at position 36 in natural Kunitz domains. In the BPTI- 
Trypsin complex, changing Gly 36 to Ser greatly reduces the 
binding to trypsin. Nevertheless, S36 or T36 may not 
interfere with binding to hNE and could even improve it. If 

35 residue 36 is not Gly, one should consider changing it to 
Gly. 

Position 39 seems to tolerate a variety of types. Met 
and Gin are known to work in very high-affinity inhibitors. 
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Either Ala or Gly are acceptable at position 40; Gly is 
preferred. At position 41, Asn is by far the most common 
type in natural Kunitz domains and may act to stabilize the 
domains. At position 42, Gly is preferred, but Ala is 
5 allowed. 

Finally, positions that are highly conserved in Kunitz 
domains may be converted to the conserved type if needed. 
For example, the mutations X36G, X37G, X41N, and X12G may be 
desirable in those cases that do not already have these 

10 amino acids at these positions. 

The above mutations are summarized in Table 711. Table 
711 contains, for example, mutations of the form X15I which 
means change the residue at position 15 (whatever it is) to 
lie or leave it alone if it is already lie. A Kunitz domain 

15 that contains the mutation X18F and either X15I or X15V 

(X15I preferred) will have strong affinity for hNE . As from 
one up to about 8 of the mutations found in Table 711 are 
asserted, the affinity of the protein for hNE will increase 
so that the K L approaches the range 1-5 pM . 

20 The sequence DPI. 1.2 was constructed from the sequence of 

App-I by the changes R15I, I18F, and F34V and should be a 
potent hNE inhibitor. DPI . 1 . 3 is likely to be a more potent 
inhibitor, having the changes R15I, M17F (to avoid 
sensitivity to oxidation), I18F, P32T, F34V, and G39M. 

25 DPI. 2. 2 was derived from the sequence of TFPI2-D1 by the 

changes R15I, L18F, and L34V and should be a potent hNE 
inhibitor. DPI. 2. 3 may be more potent due to the changes 
Y11T, R15I, L17F, L18F, R31Q, Q32T, L34V, and E39M. 

DPI. 3. 2 is derived from TFPI2-D2 by the changes E15I, 

30 T18F, S26A(to prevent glycosylation) , K32T, and F34V and 
should be a potent hNE inhibitor. DPI . 3 . 3 may be more 
potent by having the changes A9a, D11A, D12G, Q13P, E15I, 
S17F, T18F, E19K, K20R, N24A (to prevent glycosylation) , 
K32T, F34V, and A42a-42b. 

35 DPI .4.2 is derived from TFPI2-D3 by the changes S15I, 

N17F, and V18F and should be a potent inhibitor of hNE. 
DPI .4.3 may be more potent by having the changes E11T, L13P, 
S15I, N17F, V18F, A32T, T34V, and T36G. 



DPI .5.2 is derived from LACI-D1 by the changes K15I and 
M18F and is likely to be a potent inhibitor of hNE . DPI. 5. 3 
may be more potent due to the changes D10Y, D11T, K15I, 
I17F, M18F, and E32T. Other changes that may improve 
DPI. 5. 3 include F21W, I34V, E39M, and Q42G. 

The sequence of DPI. 6.2 was constructed from the sequence 
of human LACI-D2 by the mutations R15V and I18F. The rest 
of the sequence of LACI-D2 appears to be compatible with hNE 
binding. DPI . 6 . 3 carries two further mutations that make it 
more like the hNE inhibitors here disclosed: Y17F and K34V. 
Other alterations that are likely to improve the hNE binding 
of LACI-D2 include I13P, R32T, and D10S. DPI .6.4 is derived 
from DPI. 6.3 by the additional alteration N25A that will 
prevent glycosylation when the protein is produced in a 
eukaryotic cell. Other substitutions that would prevent 
glycosylation include: N25K, T27A, T27E, N25S, and N25S. 
DPI .6.5 moves further toward the ITT-Dl, ITI-D2, and BPTI 
derivatives that are known to have affinity for hNE in the 
1-5 pM range through the mutations I13P, R15V, Y17F, I18F, 
T19Q, N25A, K34V, and L39Q. In DPI .6.6, the T19Q and N25A 
mutations have been reverted. Thus the protein would be 
glycosylated in yeast or other eukaryotic cells at N 25 . 
DPI. 6.7 carries the alterations I13P, R15I, Y17F, I18F, 
T19P, K34V, and L39Q. 

DPI. 7. 2 is derived from human LACI domain 3 by the 
mutations R15V and E18F. DPI . 7 . 3 carries the mutations 
R15V, N17F, E18F, and T46K. The T46K mutation should 
prevent glycosylation at N 44 . DPI .7.4 carries more mutations 
so that it is much more similar to the known high-affinity 
hNE inhibitors. The mutations are D10V, L13P, R15V, N17F, 
E18F, K34V, S36G, and T46K. DPI . 7 . 5 carries a different 
set of alterations: L13P, R15I, N17F, E18F, N19P, F21W, 
R31Q, P32T, K34V, S36G, and T46K; DPI . 7 . 5 should not be 
glycosylated in eukaryotic cells. 

DPI. 8. 2 is derived from the sequence of the A3 collagen 
Kunitz domain by the changes R15I, D16A, I18F, and W34V and 
is expected to be a potent hNE inhibitor. DPI . 8 . 3 is 
derived from the A3 collagen Kunitz domain by the changes 



T13P, R15I, D16A, I18F, K20R, and W34V. 

DPI. 9.2 is derived from the HKI B9 Kunitz domain by the 
changes Q15I, T16A, and M18F and is expected to be a potent 
hNE inhibitor. DPI. 9.3 may be more potent due to the 
changes Q15I, T16A, M18F, T19P, E31V, and A34V. 

Example 23 : Libraries of Kunitz Domains 

Other Kunitz domains that can potently inhibit hNE may be 
derived from human Kunitz domains either by substituting 
hNE-inhibiting sequences into human domains . or by using the 
methods of US 5,223,409 and related patents. Table 720 
shows a gene that will cause display of human LACI-D2 on M13 
glllp; essentially the same gene could be used to achieve 
display on M13 gVIIIp or other anchor proteins (such as 
bacterial outer-surface proteins (OSPs) ) . Table 725 shows a 
gene to cause display of human LACI Dl . 

Table 730 and Table 735 give variegations of LACI-D1 and 
LACI-D2 respectively. Each of these is divided into 
variegation of residues 10-21 in one segment and residues 
31-42 in another. In each case, the appropriate vgDNA is 
introduced into a vector that displays the parental protein 
and the library of display phage are fractionated for 
binding to immobilized hNE. 
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Table 13: BPTI Homologues (1-19) 
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Table 13, Continued {BPTI Homologues 20-35) 
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20 


21 


22 


23 


24 


25 
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28 
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Table 13, continued 
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Table 13, continued (Homologues 36-40) 
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P 


P 




14 


C 


C 


C 


C 


C 




15 


R 


K 


K 


K 


K 




16 


A 


A 


A 


A 


A 




17 


R 


R 


R 


R 


K 




18 


I 


M 


I 


M 


M 




19 


I 


I 


I 


I 


I 




20 


R 


R 


R 


R 


R 




21 


Y 


Y 


Y 


Y 


Y 


30 


22 


F 


F 


F 


F 


F 




23 


Y 


Y 


Y 


Y 


Y 




24 


N 


N 


N 


N 


N 




25 


A 


A 


A 


A 


A 




26 


K 
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29 
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40 


A 


A 


A 


A 
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41 


K 


K 


K 


K 


K 


42 


R 


S 


R 


R 


S 


43 


N 


N 


N 


N 


N 
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Table 13, continued 



R # 


36 


37 


38 


39 


40 


44 


N 


N 


N 


N 


N 


45 


F 


F 


F 


F 


F 


46 


K 


K 


K 


K 


R 


47 


S 


S 


S 


S 


S 


48 


A 


A 


S 


A 


A 


49 


E 


E 


E 


E 


E 


50 


D 


D 


D 


D 


D 


51 


C 


C 


C 


C 


C 


52 


E 


M 


M 


M 


M 


53 


R 


R 


R 


R 


R 


54 


T 


T 


T 


T 


T 


55 


C 


C 


C 


C 


C 


56 


G 


G 


G 


G 


G 


57 


G 


G 


G 


G 


G 


58 


A 


A 


A 


A 


A 


59 












60 












61 
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Legend to Table 13 

1 BPTI 

2 Engineered BPTI From MARK87 

3 Engineered BPTI From MARK 8 7 
5 4 Bovine Colostrum (DUFT85) 

5 Bovine Serum (DUFT85) 

6 Semisynthetic BPTI, TSCH87 

7 Semisynthetic BPTI, TSCH87 

8 Semisynthetic BPTI, TSCH87 
10 9 Semisynthetic BPTI, TSCH87 

10 Semisynthetic BPTI, TSCH87 

11 Engineered BPTI, AUER87 

12 Dendroaspis polylepis polylepis (Black mamba) venom 
I (DUFT85) 

15 13 Dendroaspis polylepis polylepis (Black Mamba) venom K 

DUFT85) 

14 Hemachatus hemachates (Ringhals Cobra) HHV II 
(DUFT85) 

15 Naia nivea (Cape cobra) NNV II (DUFT85) 

20 16 Vipera russelli (Russel's viper) RVV II (TAKA74) 

17 Red sea turtle egg white (DUFT85) 

18 Snail mucus ( Helix pomania ) (WAGN78) 

19 Dendroaspis anqusticeps (Eastern green mamba) C13 Si 
C3 toxin (DUFT85) 

25 20 Dendroaspis anqusticeps (Eastern Green Mamba) 

C13 S2 C3 toxin (DUFT85) 

21 Dendroaspis polylepis polylepes (Black mamba) B toxin 
(DUFT8 5) 

22 Dendroaspis polylepis polylepes (Black Mamba) E toxin 
30 (DUFT85) 

23 Vipera ammodvtes TI toxin (DUFT85) 

24 Vipera ammodytes CTI toxin (DUFT85) 

25 Bunctarus fasciatus VIII B toxin (DUFT85) 

26 Anemonia sulcata (sea anemone) 5 II (DUFT8 5) 
35 27 Homo sapiens HI-8e "inactive" domain (DUFT85) 

28 Homo sapiens HI-8t "active" domain (DUFT85) 

29 beta bungarotoxin Bl (DUFT85) 

30 beta bungarotoxin B2 (DUFT85) 
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31 Bovine spleen TI II (FIOR85) 

32 Tachypleus tridentatus (Horseshoe crab) hemocyte 
inhibitor (NAKA87) 

33 Bombvx mori (silkworm) SCI-III (SASA84) 
5 34 Bos taurus (inactive) BI-14 

35 Bos taurus (active) BI-8 

36 :Engineered BPTI (KR15, ME52): Auerswald '88, Biol 
Chem Hoppe-Seyler , 369 Supplement, pp27-35. 

37 : Isoaprotinin G-l: Siekmann, Wenzel, Schroder, and 
10 Tschesche '88, Biol Chem Hoppe-Seyler, 369 : 157-163 . 

38 : Isoaprotinin 2: Siekmann, Wenzel, Schroder, and 
Tschesche '88, Biol Chem Hoppe-Seyler, 369 : 157-163 . 

39 : Isoaprotinin G-2 : Siekmann, Wenzel, Schroder, and 
Tschesche '88, Biol Chem Hoppe-Seyler, 369 : 157-163 . 
15 4 0 : Isoaprotinin 1: Siekmann, Wenzel, Schroder, and 

Tschesche '88, Biol Chem Hoppe-Seyler, 369 :157-163. 

Notes : 

a) both beta bungarotoxins have residue 15 deleted. 
20 b) B. mori has an extra residue between C5 and C14; we 

have assigned F and G to residue 9. 

c) all natural proteins have C at 5, 14, 30, 38, 50, & 55. 

d) all homologues have F33 and G37 . 

e) extra C's in bungarotoxins form interchain cystine 
25 bridges 
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Tables 

Table 30: IIIsp :: bpti : :mautreIII ( initial fragment) fusion 
gene. The DNA sequence has SEQ ID NO. 001; Amino-acid 
sequence has SEQ ID NO. 002. The DNA is linear and is shown 
on the lines that do not begin with " ! " . The DNA encoding 
mature III is identical to the DNA found in Ml3mpl8. The 
amino-acid sequence is processed in vivo and disulfide bonds 
form. 



10 ! SEQ ID NO. 002 mkkllfalpl 

i 123456789 10 

SEQ ID NO. 001 5'-gtg aaa aaa tta tta ttc gca att cct tta 

! |< gene III signal peptide 

15 i r cleavage site 



20 



V 


V 


P 


f 


y 


s 


G 


A 






11 


12 


13 


14 


15 


16 


17 


18 






gtt 


gtt 


cct 


ttc 


tat 


tct 


GGc 


Gcc 
->l 












R 


P 


D 


F 


C 


L 


E 








19 


20 


21 


22 


23 


24 


25 








CGT 


CCG 


GAT 


TTC 


TGT 


CTC 


GAG 



25 ! M13/BPTI Jnct T | AccIII | ] Xhol | (& Aval) ! 







P 


P 


Y 


T 


G 


P 


C 


K 


A 


R 






26 


27 


28 


29 


30 


31 


32 


33 


34 


35 






CCA 


CCA 


TAC 


ACT 


GGG 


CCC 


TGC 


AAA 


GCG 


CGC 


30 






PflMI 




1 




1 




BssHII 












J 


Apal 


1 


















J 


Cra 1 1 




PssI 






I 


I 


I 


R 


Y 


F 


Y 


N 


A 


K 


A 


35 




36 


37 


38 


39 


40 


41 


42 


43 


44 


45 






ATC 


ATC 


CGC 


TAT 


TTC 


TAC 


AAT 


GCT 


AAA 


GC 






G 


L 


C 


Q 


T 


F 


V 


Y 


G 


G 






46 


47 


48 


49 


50 


51 


52 


53 


54 


55 


40 


A 


GGC 


CTG 


TGC 


CAG 


ACC 


TTT 


GTA 


TAC 


GGT 


GGT 




i-L 


StuI 1 










Xcal 


( & 


Ace. 




! 


C 


R 


A 


K 


R 


N 


N 


F 


K 






; 


56 


57 


58 


59 


60 


61 


62 


63 


64 




45 




TGC 


CGT 


GCT 


AAG 


CGT 


AAC 


AAC 


TTT 


AAA 






! 






ESDI 


















S 


A 


E 


D 


C 


M 


R 


T 


C 


G 






65 


66 


67 


68 


69 


70 


71 


72 


73 


74 


50 




TCG 


GCC 


GAA 


GAT 


TGC 


ATG 


CGT 


ACC 


TGC 


GGT 



! |XmaIII | | Svhl \ 

I 
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! BPTI/M13 boundary 

! I 

! I G I A I A E (Residue numbers of mature III have had 

! I 75 | 76 I 119 120 118 added to the usual residue 

numbers . ) 

|GGC|GCC|gct gaa- 
! 1 Narl | (& KasI) 

I 121 122 123 124 125 126 127 128 129 130 131 132 133 134 
ITVESCLAKPHTENS... 
act gtt gaa agt tgt tta gca aaa ccc cat aca gaa aat tea. . . 

! 

! The remainder of the gene is identical to the 
corresponding part of iii in M13 mpl8 . 
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Table 35: IIIsp : : itiDl : :matureIII fusion gene. 

DNA has SEQ ID NO. 003; amino-acid sequence has SEQ ID NO. 
004 . 

The DNA is a linear segment and the amino-acid sequence is a 
protein that is processed in vivo and which contains 
disulfides . 



SEQ ID NO. 004 
mkkllfalplvvpf 

y 

-18 -17 -16 -15 -14 -13 -12 -11 -10 -9 -8 -7 -6 -5 

-4 

5 ' -gtg aaa aaa tta tta ttc gca att cct tta gtt gtt cct ttc 
tat 

SEQ ID NO. 003 

|< gene III signal peptide 



r cleavage site 
sGAKEDS CQLGYSA 

G 

-3 -2 -1 1 2 3 4 5 6 7 8 9 10 11 

12 

tct GGc Gcc aaa gaa gaC tcT tGC CAG CTG GGC tac tCG GCC 

Ggt 

>| J Bcrll I | Eagl 

| KasI 1 





13 14 


15 


16 


17 


18 


19 


20 


21 


22 


23 


24 


25 


26 


30 


P C 


M 


G 


M 


T 


S 


R 


Y 


F 


Y 


N 


G 


T 




ccc tgc 


atg 


gga 


atg 


acc 


age 


agg 


tat 


ttc 


tat 


aat 


ggt 


aca 




27 28 


29 


30 


31 


32 


33 


34 


35 


36 


37 


38 


39 


40 




41 


























35 


S M 


A 


C 


E 


T 


F 


Q 


Y 


G 


G 


C 


M 


G 




tCC ATG 


Gcc 


tgt 


gag 


act 


ttc 


cag 


tac 


ggc 


ggc 


tgc 


atg 


ggc 




aac 




























| Ncol 


-L 
























40 


1 Styl 


-L 


























42 43 


44 


45 


46 


47 


48 


49 


50 


51 


52 


53 


54 


55 




56 




























G N 


N 


F 


V 


T 


E 


K 


E 


C 


L 


Q 


T 


C 


45 


ggt aac 


aac 


ttc 


gtc 


aca 


gaa 


aag 


gag 


tgt 


CTG 


CAG 


acc 


tgc 



1 Ps tl 



57 58 101 102 119 120 
TV g a A E 

act gtg ggc gcc get gaa 
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| Bbel | (Residue numbers of mature 

| Narl | III have had 118 added to 
| KasI | the usual residue numbers.) 

121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 
TVESCLAKPHTENSF 

act gtt gaa agt tgt tta gca aaa ccc cat aca gaa aat tea 
ttt . . 



The remainder of the gene is identical to the corresponding 
part of gene iii in phage M13mpl8 . 
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Table 55: Affinity Classes of ITI-D1 -derived hNE inhibitors 



Affinity 
Class 


Estimated 
K D 


Fraction of 

Input 

bound 


pH Elution 
Maximum 


Protein 


WEAK 


K D > 10 nM 


<0.005% 


>6.0 


ITI-D1 


MODERAT 
E 


1 to 10 nM 


0.01% to 
0.03% 


5.5 to 5.0 


BITI 

ITI-D1E7 


STRONG 


10 to 1000 
pM 


0.03% to 
0.06% 


5.0 to 4.5 


BITI-E7 

BITI-E7-1222 

AMINOI 

AMIN02 

MUTP1 


VERY 
STRONG 


< 10 pM 


>0.1% 


<4.0 


BITI-E7-141 
MUTT26A 
MUTQE 
MUT1619 
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Table 65: Definition of Class A, B and C mutations in 
PCT/US92/01501. 



No major effect expected if molecular charge 
stays in range -1 to +1. 

Major effects not expected, but are more 
likely than in "A". 

Residue in the binding interface; any change 

must be tested. 

No substitution allowed. 



Id. EpiNEl Substitutions 

1 R any 

2 P any 

3 D any 

4 F Y, W, L 

5 C C 

6 L non-proline 

7 E L, S, T, D, N, K, R 

8 P any 

9 P any 

10 Y non-proline prefr'd 

11 T any 

12 G must be G 

13 P any 

14 C C strongly preferred, any non-proline 

15 I V, A 

16 A 

17 F L, I, M, Y, W, H, V 

18 F Y, W, H 

19 P any 

20 R non-proline prefr'd 

21 Y F & Y most prefr'd; W, I, L prefr'd; M, 1 

allowed 

22 F Y & F most prefr'd; non-proline prefr'd 

23 Y Y & F strongly prefr'd 

24 N non-proline prefr'd 

25 A any 

26 K any 

27 A any 

28 G non-proline prefr'd 

29 L non-proline prefr'd 

30 C must be C 
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31 


Q 


non—proline prefr'd 




B 




32 


T 


non-proline prefr'd 




B 




33 


F 


F very strongly prefr d, Y 


possxble 






34 


V 


any 




C 


5 


35 


Y 


Y most prefr'd; W prefr'd; 


F allowed 


B 




Res . 












Id. 


EpiNEl 


Substitutions 




Class 


10 


36 


G 


G strongly prefr'd; S, A prefr'd; 


c 
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must be G so long as 38 




x 




38 


c 


C strongly pjrefjr'd 
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any 
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any non-proline 




B 


ni 


47 


ST N A 


G 








48 




any 






O 


49 


E 


any 




A 








any 






:25 


51 


c 


must be C 
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: Sequences of Kunitz domains 
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Sequences listed in Table 100 that strongly inhibit hNE are 
EPI-HNE-1 (=EpiNEl) , EPI-HNE-2, EpiNE7, EpiNE3 , EpiNE6 , 
EpiNE4, EpiNE8, EpiNE5 , EpiNE2 , BITI-E7-141, MUTT26A, MUTQE, 
MUT1619, ITI-D1E7, AMINOl, AMIN02, MUTP1, and EPI-HNE-3, and 
EPI-HNE-4. Sequences listed in Table 100 that are highly 
likely to strongly inhibit hNE are DPI. 1.1, DPI. 1.2, 
DPI. 1.3, DPI. 2.1, DPI. 2. 2, DPI. 2. 3, DPI. 3.1, DPI. 3. 2, 
DPI. 3. 3, DPI. 4.1, DPI. 4. 2, DPI. 4. 3, DPI. 5.1, DPI. 5. 2, 
DPI. 5. 3, DPI. 6.1, DPI. 6.2, DPI. 6.3, DPI. 6.4, DPI. 6.5, 
DPI. 6. 6, DPI. 6.7, DPI. 7.1, DPI. 7. 2, DPI. 7. 3, DPI. 7. 4, 
DPI. 7. 5, DPI. 8.1, DPI. 8. 2, DPI. 8. 3, DPI. 9.1, DPI. 9. 2, and 
DPI. 9. 3. Human Kunitz domains listed in Table 100: ITI-D1, 
ITI-D2, App-I, TFPI2-D1, TFPI2-D2, TFPI2-D3, LACI-D1, LACI- 
D2, LACI-D3, A3 collagen Kunitz domain, and HKI B9 Domain. 



Table 111: Restriction sites in plasmid pHIL-D2 
pHIL-D2, 93-01-02 Ngene = 8157 

Non-cutters 



Aflll Apal 
Bspl20l BsrGI 
Pad Pmll 
SnaBI Spel 
Xmal (Smal) 



AscI Aval Avrll BamHI Bglll 

BssHII BstEII Fsel Mlul Nrul 



RsrII SacII 
Sse8387I 



SexAI Sfil 
Xhol (PaeR7l) 



Sgf I 



Cutters 



Aatll GACGTc 1 5499 

Afllll Acrygt 1 7746 

Agel Accggt 1 1009 

BlpI GCtnagc 1 597 

BspEI (BspMII,AccIII) Tccgga 1 3551 

BspMI gcaggt 1 4140 

Bstll07l GTAtac 1 7975 
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BstBI(AsuII) TTcgaa 


2 


945 




Bsu36l CCtnagg 


1 


1796 




Ecll36l GAGctc 


1 


216 




EcoRI Gaattc 


1 


956 


'5 


Espl (Bpull02I) GCtnagc 


1 


597 




Hpal 


GTTaac 


1 


1845 




Ncol 


Ccatgg 


1 


3339 




Ndel 


CAtatg 


1 


7924 




Nsil 


(PpulOl) ATGCAt 


1 


684 


10 


PflMI CCANNNNntgg 


1 


196 




Pmel 


GTTTaaac 


1 


420 




PstI 


CTGCAg 


1 


6175 


O 


Pvul 


CGATcg 


1 


6049 


%"\ 


Sapl 


gaagagc 


1 


7863 


3.5 


Sad 


GAGCTc 


1 


216 


iff 1 


Sail 


Gtcgac 


1 


2885 


HI 


Seal 


AGTact 


1 


5938 




SphI 


GCATGc 


1 


4436 


: ; 


StuI 


AGGcct 


1 


2968 


20 


Swal 


ATTTaaat 


1 


6532 




Tthllll GACNnngtc 


1 


7999 




Xbal 


Tctaga 


1 


1741 


25 


Xcml 


CCANNNNNnnnntgg 


1 


711 


Aoxl 
Aoxl 
His4 
Aoxl 


5' 1 to about 950 

3' 950 to about 1250 

1700 to about 4200 
3' 4500 to 5400 






30 


bla 


5600 to 6400 







fl ori 6500 to 6900 
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TABLES 207-208 (merged) 
SEQUENCES OF THE EpiNE CLONES IN THE PI REGION 



CLONE 
IDENTIFIERS 


SEQUENCE 




1 
3 


1 
4 


1 1 

5 6 


1 
7 


1 
8 


1 

9 


2 
0 


2 
1 




BPTI (comp. 
only) 


P 

(SEQ 


C 

ID 


K A 
NO: 6) 


R 


I 


I 


R 


Y 


(BPTI ) 




P 


c 


V A 


M 




Q 






EpiNEa 


3, 9, 16, 17, 
18, 19 


P 

(SEQ 


C 

ID 


V G 
NO: 10 ) 


F 


F 


s 


R 


Y 


EpiNE3 


6 


P 

( SEQ 


C 


V G 


F 


F 


Q 


R 


Y 


EpiNE6 


7, 13, 14, 15, 
20 


P 

(SEQ 


C 

ID 


V A 
NO: 9) 


M 


F 


P 


R 


Y 


EpiNE7 


4 


P 

(SEQ 


C 

ID 


V A 
NO: 12) 


I 


F 


P 


R 


Y 


EpiNE4 


8 


P 

(SEQ 


C 

ID 


V A 
NO: 13) 


I 


F 


K 


R 


S 


EpiNE8 


1, 10, 11, 12 


P 

(SEQ 


C 

ID 


I A 
NO: 7) 


F 


F 


P 


R 


Y 


EpiNEl 


5 


P 
(SEQ 


C 

ID 


I A 
NO: 14) 


F 


F 


Q 


R 


Y 


EpiNE5 


2 


P 

(SEQ 


C 

ID 


I A 
NO: 15) 


L 


F 


K 


R 


Y 


EpiNE2 



Note: The DNA sequences encoding these amino acid 
sequences are set forth in 08/133,031, previously incorporated by 
reference . 
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TABLE 212: Fractionation of EpiNE-7 and MA-ITI-D1 phage on hNE beads 



R l5 





EpiNE-7 


MA-ITI-D1 


pfu 


pfu/INPUT 


pfu 


pfu/INPUT 


INPUT 


3.3-10 9 


1.00 


3.4-10 11 


1.00 


Final 

TBS-TWEEN 
Wash 


3.8-1 0 5 


1.2-10" 4 


1.8-10 6 


5.3-1 0" 6 


pH 7.0 


6.2-1 0 5 


1.8-10" 4 


1.6-10 6 


4.7-1 0" 6 


6.0 


1.4-10 6 


4.1-10" 4 


1.0-1 0 s 


2.9-10" 6 


5.5 


9.4-1 0 5 


2.8-1 0- 4 


1.6-1 0 s 


4.7-1 0" 6 


5.0 


9.5-10 5 


2.9-1 0" 4 


3.1-10 5 


9.1-10" 7 


4.5 


1.2-10 6 


3.5-10" 4 


1.2-10 5 


3.5-1 0" 7 


4.0 


1.6-10 6 


4.8-10- 4 


7.2-1 0 4 


2.1-10" 7 


3.5 


9.5-10 5 


2.9-10" 4 


4.9-10 4 


1.4-10" 7 


3.0 


6.6-10 5 


2.0-1 0" 4 


2.9-1 0 4 


8.5-1 0" 8 


2.5 


1.6-10 5 


4.8-1 0" 5 


1.4-10 4 


4.1-10" 8 


2.0 


3.0-10 5 


9.1-10" 5 


1.7-10 4 


5.0-1 0 s 


SUM 


6.4-1 0 6 


3-1 0" 3 


5.7- 10 6 


2-10 5 



* SUM is the total pfu (or fraction of input) obtained from all pH elution fractions 



TABLE 214: Abbreviated fractionation of display phage on hNE beads 





Display phage 


EpiNE-7 


MA-ITI-D1 2 


MA-ITI-D1E7 1 


MA-ITI-D1E7 2 


INPUT 


1.00 

M ft x If) 9 "* 


1.00 
(1 2 x 10 10 


1.00 

fi i -i n9\ 
^o.O X 11) ) 


1.00 

(1.1 x 10 9 ) 


Wash 


6-1 0" 5 


1-10" 5 


2-1 0" 5 


2-10" 5 


pH 7.0 


3-1 0" 4 


1-10" 5 


2-1 0- 5 


4-1 0" 5 


pH 3.5 


3-1 0" 3 


3-1 0" 6 


8-1 0" 5 


8-1 0" 5 


pH 2.0 


1-10" 3 


1-10" 6 


6-10" 6 


2- 10 s 


SUM 


4.3-1 0 3 


1.4-10" 5 


1.1-10-* I 


1.4-10" 4 



SUM is the total fraction of input pfu obtained from all pH elution fractions 
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TABLE 215: Fractionation of EpiNE-7 and MA-ITI-D1E7 phage on hNE beads 





EpiNE-7 


MA-ITI-D1E7 j 


Total pfu 


Fraction of 


Total pfu 


Fraction of 
Input 


INPUT 


1.8-10 9 


1.00 


3.0-1 0 9 


1.00 


pH 7.0 


5.2-1 0 5 


2.9-10 -4 


6.4-1 0 4 


2.1-1 0' 5 


pH 6.0 


6.4-1 0 5 


3.6-1 0" 4 


4.5-1 0 4 


1.5-10- 5 


pH 5.5 


7.8-10 5 


4.3-1 0" 4 


5.0-10 4 


1.7-10' 5 


pH 5.0 


8.4-1 0 5 


4.7-1 0" 4 


5.2-1 0 4 


1.7-10" 5 


pH 4.5 


1.1-10 6 


6.1-10" 4 


4.4-1 0 4 


1.5-10" 5 


pH 4.0 


1.7-10 6 


9.4-1 0" 4 


2.6-1 0 4 


8.7-1 0" 6 


pH 3.5 


1.1-10 6 


6.1-10" 4 


1.3-10 4 


4.3-1 0 6 


pH 3.0 


3.8-1 0 5 


2.1-10" 4 


5.6-1 0 3 


1.9-10" 6 


pH 2.5 


2.8-1 0 5 


1.6-10" 4 


4.9-1 0 3 


1.6-10" 6 


pH 2.0 


2.9-10 5 


1.6-10" 4 


2.2-1 0 3 


7.3-1 0" 7 


SUM 


7.6-10 6 


4.1-10" 3 


3.1-10 5 


1.1-10" 4 



* SUM is the total pfu (or fraction of input) obtained from all pH elution fractions . 
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TABLE 217: Fractionation of MA-BITI-E7 and MA-BITI-E7-1222 on hNE beads 





MA-BITI-E7 


[MA-BITI-E7-1222 


pfu 


pfu/INPUT 


mfu 


pfu/INPUT 


INPUT 


1.3-10" 


1.00 


I 1.2 10 9 


1.00 


pH 7.0 


4.7 10 4 


3.6 10" 5 


4 0 10 4 


3310 — 


6.0 


5.3 10 4 


4.1 10 5 


5 5 10 4 


4.6 10" 5 


5.5 


7.1i0 4 


5.5 10" 5 


5 4 10 4 


4.5 10" 5 


5.0 


9.0 10 4 


6.9 10~ 5 


6.7 1 0 4 


5 6 10" 5 


4.5 


6.2 10 4 


4.8 10" 5 


6.710 4 


5.6 10" 5 


4.0 


3.4 10 4 


2.6 10" 5 


2.710 4 


2.210" 5 


3.5 


1.8-1 0 4 


1.410" 5 


2.3 10 4 


1.9 10" 5 


3.0 


2.510 3 


1.9 10" 6 


6.3 10 3 


5.2 10" 6 . 


2.5 


<1.310 3 


<1.010" 6 


<1.310 3 


<1.010" 6 


2.0 


1.3 10 3 


1.0 10' 6 


1.310 3 


1.0-1O* 


SUM 


3.810 5 


2.9 10 4 


3.4 10 5 


2.8 10" 4 



SUM is the total pfu (or fraction of input) obtained from all pH elution fractions 
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TABLE 218: Fractionation of MA-EpiNE7 and MA-BITI-E7-141 on hNE beads 





MA-EpiNE7 


MA-BITI-E7-141 




pfu 


pfu/INPUT 


pfu 


pfu/INPUT 


INPUT 


6.1 10 8 


1.00 


2.0 10 9 


1.00 


pH 7.0 


5.3 10 4 


8.7 10 s 


4.5 10 5 


2 21 0" 4 


6.0 


9.7 10 4 


1.6 10~ 4 


4.4 10 5 


2 21 0" 4 


5.5 


1.1 10 s 


1.8 10~ 4 


4 41 0 5 




5.0 


1.410 s 


2.3 10" 4 


7 2 10 5 


o.o i u 


4.5 


1.0 10 5 


1.6 10" 4 


1.3 10 6 


6 51 0" 4 


4.0 


2.010 5 


3.3 10" 4 


1.1 10 6 


5.5 10" 4 


3.5 


9.710 4 


1.6 10" 4 


5.9 10 5 


3.0 10- 4 


3.0 


3.810 4 


6.2 10" 5 


2.310 5 


1.2i0" 4 


2.5 


1.3 10 4 


2.1 10" 5 


1.2-1 0 s 


6.0 10" 5 


2.0 


1.610 4 


2.6 10" 5 


1.010 5 


5.0 10 5 


SUM 


8.610 5 


1.4-10* 


5.5 10 6 


2.8 10" 3 


£>um is tne total pfu (or fraction of input) obtained 


rom all pH el 


Jtion Tractions 



TABLE 219: pH Elution Analysis of hNE Binding by BITI-E7-141 Varient Display 
Phage 



3.0 



Displayed protein 


Input 


Fraction of Input 
recovered at pH 


Recovery 


PFU 

(x1 0 9 ) 


pH7.0 


pH3.5 

xlO" 4 


pH2.0 
xlO" 4 


Total 

x10 4 


Relative 


AMINU1 (bbj 


0.96 


0.24 


2.3 


0.35 


2 9 




AMINUz (Ab) 


6.1 


0.57 


2.1 


0 45 


3 1 


0.12 


D|TI C7 loon /[— r— \ 

Dl 1 l-b/- iZZz: (bb) 


1.2 


0.72 


4.0 


0.64 


5.4 


0.21 


EpiNE7 (EE) 


0.72 


0.44 


6.4 


2.2 


9.0 


0.35 




3.9 


1.8 


9.2 


1.2 


12.0 


0.46 


MUT1619 (EE) 


0.78 


0.82 


9.9 


0.84 


12.0 


0.46 


MUTQE (AE) 


4.7 


1.2 


16. 


5.3 


22.0 


0.85 


MUTT26A (EE) 


0.51 


2.5 


19.0 


3.3 


25.0 


0.96 


BITI-E7-141 (AE) 


1.7 


2.2 


18.0 


5.4 


26.0 


1.00 


BITI-E7-141 (EE) 


0.75 


2.1 


21. 


3.2 


26.0 


1.00 



Extended pH elution protocol 
Abbreviated pH elution protocol 

Total fraction of input = Sum of fractions collected at pH 

7.0, pH 3.5, and pH 2.0. 
Total fraction of input recovered divided by total fraction of input 
recovered for BITI-E7-141 



Notes: 
EE 
AE 
Total 

Relative 
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Table 250: Plasmid pHIL-D2 SEQ ID NO. 070 

8157 base pairs. Only one strand is shown, but the DNA 

exists as double -stranded circular DNA in vivo. 

1 2 3 4 5 

1234567890 1234567890 1234567890 1234567890 1234567890 

1 AgATCgCggC CgCgATCTAA CATCCAAAgA CgAAAggTTg AATgAAACCT 

51 TTTTgCCATC CgACATCCAC AggTCCATTC TCACACATAA gTgCCAAACg 

101 CAACAggAgg ggATACACTA gCAgCAgACC gTTgCAAACg CAggACCTCC 

151 ACTCCTCTTC TCCTCAACAC CCACTTTTgC CAT CgAAAAA CCAgCCCAgT 

201 TATTgggCTT gATTggAgCT CgCTCATTCC AATTCCTTCT ATTAggCTAC 

2 51 TAACACCATg ACTTTATTAg CCTgTCTATC CTggCCCCCC TggCgAggTC 

3 01 ATgTTTgTTT ATTTCCgAAT g CAACAAg CT CCgCATTACA CCCgAACATC 

3 51 ACTCCAgATg AgggCTTTCT gAgTgTgggg TCAAATAgTT TCATgTTCCC 

4 01 AAATggCCCA AAACTgACAg TTTAAACgCT gTCTTggAAC CTAATATgAC 
4 51 AAAAgCgTgA TCTCATCCAA g ATgAACTAA gTTTggTTCg TTgAAATgCT 
501 AACggCCAgT TggTCAAAAA gAAACTTCCA AAAgTCgCCA TACCgTTTgT 
551 CTTgTTTggT ATTgATTgAC gAATgCTCAA AAATAATCTC ATTAATgCTT 
6 01 AgCgCAgTCT CTCTATCgCT TCTgAACCCg gTggCACCTg TgCCgAAACg 
651 CAAATggggA AACAACCCgC TTTTTggATg ATTATgCATT gTCCTCCACA 
701 TTgTATgCTT CCAAgATTCT ggTgggAATA CTgCTgATAg CCTAACgTTC 
751 ATgAT CAAAA TTTAACTgTT CTAACCCCTA CTTgACAggC AATATATAAA 
801 CAgAAggAAg CTgCCCTgTC TTAAACCTTT TTTTTTATCA TCATTATTAg 
851 CTTACTTTCA TAATTgCgAC TggTTCCAAT TgACAAgCTT TTgATTTTAA 
9 01 CgACTTTTAA CgACAACTTg AgAAgATCAA AAAACAACTA ATTATTCgAA 

BatBI 

951 ACgAggAATT_CgCCTTAgAC ATgACTgTTC CTCAgTTCAA gTTgggCATT 
EcoRI 

10 01 ACgAgAAgAC CggTCTTgCT AgATTCTAAT CAAgAggATg TCAgAATgCC 
1051 ATTTgCCTgA gAgATgCAgg CTTCATTTTT gATACTTTTT TATTTgTAAC 
1101 CTATATAgTA TAggATTTTT TTTgTCATTT TgTTTCTTCT CgTACgAgCT 
1151 TgCTCCTgAT CAgCCTATCT CgCAgCTgAT gAATATCTTg TggTAggggT 
1201 TTgggAAAAT CATTCgAgTT TgATgTTTTT CTTggTATTT CCCACTCCTC 
1251 TTCAgAgTAC AgAAgATTAA gTgAgAAgTT CgTTTgTgCA AgCTTATCgA 

13 01 TAAgCTTTAA TgCggTAgTT TATCACAgTT AAATTgCTAA CgCAgTCAgg 
1351 CACCgTgTAT gAAAT CTAAC AATgCgCTCA TCgTCATCCT CggCACCgTC 

14 01 ACCCTggATg CTgTAggCAT AggCTTggTT ATgCCggTAC TgCCgggCCT 
1451 CTTgCgggAT ATCgTCCATT CCgACAgCAT CgCCAgTCAC TATggCgTgC 
1501 TgCTAgCgCT ATATgCgTTg ATgCAATTTC TATgCgCACC CgTTCTCggA 
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1551 gCACTgTCCg ACCgCTTTgg CCgCCgCCCA gTCCTgCTCg CTTCgCTACT 
1601 TggAgCCACT ATCgACTACg CgATCATggC gACCACACCC gTCCTgTggA 
1651 TCTATCgAAT CTAAATgTAA gTTAAAATCT CTAAATAATT AAATAAgTCC 
1701 CAgTTTCTCC ATACgAACCT TAACAg CATT gCggTgAgCA TCTAgACCTT 
5 1751 CAACAgCAgC CAgATCCATC ACTgCTTggC CAATATgTTT CAgTCCCTCA 

18 01 ggAgTTACgT CTTgTgAAgT gATgAACTTC TggAAggTTg CAgTgTTAAC 
18 51 TCCgCTgTAT TgACgggCAT AT C CgTACgT TggCAAAgTg TggTTggTAC 
1901 CggAggAgTA ATCTCCACAA CTCTCTggAg AgTAggCACC AACAAACACA 
1951 gATCCAgCgT gTTgTACTTg ATCAACATAA gAAgAAgCAT TCTCgATTTg 
10 20 01 CAggATCAAg TgTTCAggAg CgTACTgATT ggACATTTCC AAAgCCTgCT 

2 051 CgTAggTTgC AACCgATAgg gTTgTAgAgT gTgCAATACA CTTgCgTACA 
2101 ATTTCAACCC TTggCAACTg CACAgCTTgg TTgTgAACAg CATCTTCAAT 
2151 TCTggCAAgC TCCTTgTCTg TCATATCgAC AgCCAACAgA ATCACCTggg 
Q 22 01 AATCAATACC ATgTTCAgCT TgAgCAgAAg gTCTgAggCA ACgAAATCTg 

j4 5 2251 gATCAgCgTA TTTATCAgCA ATAACTAgAA CTTCAgAAgg CCCAgCAggC 

m 23 01 ATgT CAATAC TACACAgggC TgATgTgTCA TTTTgAACCA TCATCTTggC 

fll 23 51 AgCAgTAACg AACTggTTTC CTggACCAAA TATTTTgTCA CACTTAggAA 

q 24 01 CAgTTTCTgT TCCgTAAgCC ATAgCAgCTA CTgCCTgggC gCCTCCTgCT 

N 2451 AgCACgATAC ACTTAgCACC AACCTTgTgg gCAACgTAgA TgACTTCTgg 

|jl0 2501 ggTAAgggTA CCATCCTTCT TAggTggAgA TgCAAAAACA ATTTCTTTgC 

p 2 551 AACCAgCAAC TTTggCAggA ACACCCAgCA TCAgggAAgT ggAAggCAgA 

!U 2601 ATTgCggTTC CACCAggAAT ATAgAggCCA ACTTTCTCAA TAggT CTTgC 

2 651 AAAACgAgAg CAgACTACAC CAgggCAAgT CTCAACTTgC AACgTCTCCg 
2701 TTAgTTgAg C TTCATggAAT TTCCTgACgT TATCTATAgA gAgAT CAATg 

25 2 751 gCTCTCTTAA CgTTATCTgg CAATTgCATA AgTTCCTCTg ggAAAggAgC 

2801 TTCTAACACA ggTgTCTTCA AAgCgACTCC ATCAAACTTg gCAgTTAgTT 
2851 CTAAAAgggC TTTgTCACCA TTTTgACgAA CATTgTCgAC AATTggTTTg 
2901 ACTAATTCCA TAATCTgTTC CgTTTTCTgg ATAggACgAC gAAgggCATC 
2951 TTCAATTTCT TgTgAggAgg CCTTAgAAAC gTCAATTTTg CACAATTCAA 

3 0 3 001 TACgACCTTC AgAAgggACT TCTTTAggTT TggATTCTTC TTTAggTTgT 

3 051 TCCTTggTgT ATCCTggCTT ggCATCTCCT TTCCTTCTAg TgACCTTTAg 
3101 ggACTTCATA TCCAggTTTC TCTCCACCTC gTCCAACgTC ACACCgTACT 
3151 TggCACATCT AACTAATgCA AAATAAAATA AgTCAgCACA TTCCCAggCT 

32 01 ATATCTTCCT TggATTTAgC TTCTgCAAgT TCATCAgCTT CCTCCCTAAT 
3 5 3 251 TTTAgCgTTC AACAAAACTT CgTCgTCAAA TAACCgTTTg gTATAAgAAC 

3 3 01 CTTCTggAgC ATTgCTCTTA CgATCCCACA AggTgCTTCC ATggCTCTAA 

33 51 gACCCTTTgA TTggCCAAAA CAggAAgTgC gTTCCAAgTg ACAgAAACCA 

34 01 ACACCTgTTT gTTCAACCAC AAATTTCAAg CAgTCTCCAT CACAATCCAA 
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34 51 TTCgATACCC AgCAACTTTT gAgTTCgTCC AgATgTAgCA CCTTTATACC 
3501 ACAAACCgTg ACgACgAgAT TggTAgACTC CAgTTTgTgT CCTTATAgCC 
3 551 TCCggAATAg ACTTTTTggA CgAgTACACC AggCCCAACg AgTAATTAgA 
3 601 AgAgTCAgCC ACCAAAgTAg TgAATAgACC ATCggggCgg TCAgTAgTCA 
3651 AAgACgCCAA CAAAATTTCA CTgACAgggA ACTTTTTgAC AT CTT CAgAA 
3 701 AgTTCgTATT CAgTAgT CAA TTgCCgAgCA TCAATAATgg ggATTATACC 
3 751 AgAAgCAACA gTggAAgTCA CATCTAC CAA CTTTgCggTC TCAgAAAAAg 
3 801 CATAAACAgT TCTACTACCg CCATTAgTgA AACTTTTCAA ATCgCCCAgT 
3 851 ggAgAAgAAA AAggCACAgC gATACTAgCA TTAgCgggCA AggATgCAAC 
3901 TTTATCAACC AgggTCCTAT AgATAACCCT AgCgCCTggg ATCATCCTTT 

3 951 ggACAACTCT TTCTgCCAAA TCTAggTCCA AAATCACTTC ATTgATACCA 
40 01 TTATACggAT gACTCAACTT gCACATTAAC TTgAAgCTCA gTCgATTgAg 

4 051 TgAACTTgAT CAggTTgTgC AgCTggTCAg CAgCATAggg AAACACggCT 
4101 TTTCCTACCA AACTCAAggA ATTATCAAAC TCTgCAACAC TTgCgTATgC 
4151 AggTAgCAAg ggAAATgTCA TACTTgAAgT CggACAgTgA gTgTAgTCTT 
4 201 gAgAAATT CT gAAgCCgTAT TTTTATTATC AgTgAgTCAg TCATCAggAg 

42 51 ATCCTCTACg CCggACgCAT CgTggCCggC ATCACCggCg CCACAggTgC 

43 01 ggTTgCTggC gCCTATATCg CCgACATCAC CgATggggAA gATCgggCTC 

43 51 gCCACTTCgg gCTCATgAgC gCTTgTTTCg gCgTgggTAT ggTggCAggC 

44 01 CCCgTggCCg ggggACTgTT gggCgCCATC TCCTTgCATg CACCATTCCT 
44 51 TgCggCggCg gTgCTCAACg gCCTCAACCT ACTACTgggC TgCTTCCTAA 
4 501 TgCAggAgTC gCATAAgggA gAgCgTCgAg TATCTATgAT TggAAgTATg 
4551 ggAATggTgA TACCCgCATT CTTCAgTgTC TTgAggTCTC CTATCAgATT 
4 6 01 ATgCCCAACT AAAgCAACCg gAggAggAgA TTT CATggTA AATTTCTCTg 
46 51 ACTTTTggTC ATCAgTAgAC TCgAACTgTg AgACTAT CT C ggTTATgACA 
4 701 gCAgAAATgT CCTTCTTggA gACAgTAAAT gAAgTCCCAC CAATAAAgAA 
4 751 ATCCTTgTTA TCAggAACAA ACTTCTTgTT TCgAACTTTT TCggTgCCTT 
4801 gAACTATAAA ATgTAgAgTg gATATgTCgg gTAggAATgg AgCgggCAAA 
4851 TgCTTACCTT CTggACCTTC AAgAggTATg TAgggTTTgT AgATACTgAT 
4 901 gCCAACTTCA gTgACAACgT TgCTATTTCg TT CAAAC CAT TCCgAATCCA 
4 951 gAgAAATCAA AgTTgTTTgT CTACTATTgA TCCAAgCCAg TgCggTCTTg 
50 01 AAACTgACAA TAgTgTgCTC gTgTTTTgAg gTCATCTTTg TATgAATAAA 
5051 TCTAgTCTTT gATCTAAATA ATCTTgACgA gCCAAggCgA TAAATACCCA 
5101 AATCTAAAAC TCTTTTAAAA CgTTAAAAgg ACAAgTATgT CTgCCTgTAT 
5151 TAAACCCCAA ATCAgCTCgT AgTCTgATCC TCATCAACTT gAggggCACT 

52 01 ATCTTgTTTT AgAgAAATTT gCggAgATgC gATATCgAgA AAAAggTACg 
5251 CTgATTTTAA ACgTgAAATT TATCTCAAgA TCgCggCCgC gATCTCgAAT 

53 01 AATAACTgTT ATTTTTCAgT gTTCCCgATC TgCgTCTATT TCACAATACC 
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5351 


AACATgAgTC 






gATgATAAgC 












gTCAggTggC 


5 




rprprprp/-irp7\ 7\ 7\ rp 

1 1 1 1 L. ± AAA.1 






TAAATgCTTC 






v~v~gigicgcc 






I— X (—iil„l„L.AgA 






gCACgAgTgg 






gAgTTTTCgC 






TgCTATgTgg 






ggTCgCCgCA 






CACAgAAAAg 




6001 


CTgCCATAAC 






ATCggAggAC 




6101 


TgTAACTCgC 




6151 


ACgACgAgCg 




6201 


AAACTATTAA 




6251 


AgACTggATg 


-2 0 


6301 


TTCCggCTgg 


O 




TCTCgCggTA 






CgTAgTTATC 




6451 


gACAgATCgC 




6501 


gACCAAgTTT 




6551 


ATATTTTgTT 






AAC CAAT Ag g 






CgAgATAggg 




6701 


AgAACgTggA 




6751 


ggCCCACTAC 


3 0 




L.L.g 1 AAAgL-A 






gACggggAAA 




6901 


ggAgCgggCg 




6951 


CAC CAC AC C C 




70 01 


TAggTgAAgA 


35 


7051 


gTTTTCgTTC 




7101 


CTTgAgATCC 




7151 


CCACCgCTAC 




7201 


TTTTCCgAAg 



AgCTTATCgA TgATAAgCTg 
TgTCAAACAT gAgAAATCTT 
TTTTATAggT TAATgTCATg 
ACTTTTCggg gAAATgTgCg 
AC AT T C AAAT ATgTATCCgC 
AATAATATTg AAAAAggAAg 
CTTATTCCCT TTTTTgCggC 
AACgCTggTg AAAgTAAAAg 
gTTACATCgA ACTggATCTC 
CCCgAAgAAC gTTTTCCAAT 
CgCggTATTA TCCCgTgTTg 
TACACTATTC TCAgAATgAC 
CATCTTACgg ATggCATgAC 
CATgAgTgAT AACACTgCgg 
CgAAggAgCT AACCgCTTTT 
CTTgATCgTT gggAACCggA 
TgACACCACg ATgCCTgCAg 
CTggCgAACT ACTTACTCTA 
gAggCggATA AAgTTgCAgg 
CTggTTTATT gCTgATAAAT 
TCATTgCAgC ACTggggCCA 
TACACgACgg ggAgTCAggC 
TgAgATAggT gCCTCACTgA 
ACTCATATAT ACTTTAgATT 
AAAATTCgCg TTAAATTTTT 
CCgAAATCgg CAAAATCCCT 
TTgAgTgTTg TTCCAgTTTg 
CTCCAACgTC AAAgggCgAA 
gTgAAC CATC ACCCTAATCA 
CTAAATCggA ACCCTAAAgg 
gCCggCgAAC gTggCgAgAA 
CTAgggCgCT ggCAAgTgTA 
gCCgCgCTTA ATgCgCCgCT 
TCCTTTTTgA TAATCTCATg 
CACTgAgCgT CAgACCCCgT 
TTTTTTTCTg CgCgTAATCT 
CAgCggTggT TTgTTTgCCg 
gTAACTggCT TCAgCAgAgC 



TCAAACATgA gAATTAATTC 
gAAgACgAAA gggCCTCgTg 
ATAATAATgg TTTCTTAgAC 
CggAACCCCT ATTTgTTTAT 
TCATgAgACA ATAACCCTgA 
AgTATgAgTA TTCAACATTT 
ATTTTgCCTT CCTgTTTTTg 
ATgCTgAAgA TCAgTTgggT 
AACAgCggTA AgATCCTTgA 
gATgAgCACT TTTAAAgTTC 
ACgCCgggCA AgAgCAACTC 
TTggTTgAgT ACTCACCAgT 
AgTAAgAgAA TTATgCAgTg 
CCAACTTACT TCTgACAACg 
TTgCACAACA TgggggATCA 
gCTgAATgAA gCCATACCAA 
CAATgg CAAC AACgTTgCgC 
gCTTCCCggC AACAATTAAT 
ACCACTTCTg CgCTCggCCC 
CTggAgCCgg TgAgCgTggg 
gATg gTAAgC CCTCCCgTAT 
AACTATggAT gAACgAAATA 
TTAAgCATTg gTAACTgTCA 
gATTTAAATT gTAAACgTTA 
gTTAAAT CAg CTCATTTTTT 
TAT AAAT C AA AAgAATAgAC 
gAACAAgAgT C CAC TATTAA 
AAACCgTCTA TCAgggCgAT 
AgTTTTTTgg ggTCgAggTg 
gAgCCCCCgA TTTAgAgCTT 
AggAAgggAA gAAAgCgAAA 
gCggTCACgC TgCgCgTAAC 
ACAgggCgCg TAAAAggATC 
ACCAAAATCC CTTAACgTgA 
AgAAAAgATC AAAggATCTT 
gCTgCTTgCA AACAAAAAAA 
gATCAAgAgC TACCAACTCT 
gCAgATACCA AATACTgTCC 
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TTCTAgTgTA 


gCCgTAgTTA 






LL1ALA1ALL 


TCgCTCTgCT 






CgATAAgT Cg 


TgTCTTACCg 




7401 


AggCgCAgCg 


gTCgggCTgA 




7451 


gAgCgAACgA 


CCTACACCgA 




7501 


AAgCgCCACg 


CTTCCCgAAg 






gCAgggTCgg 


AACAggAgAg 




7601 


TggTATCTTT 


ATAgTCCTgT 






ATTTTTgTgA 


TgCTCgTCAg 




7701 


ACgCggCCTT 


TTTACggTTC 




7751 


TTCTTTCCTg 


CgTTATCCCC 




7801 


TgAgTgAgCT 


gATACCgCTC 




51 


CAgTgAgCgA 


ggAAgCggAA 


5 


7901 


CATCTgTgCg 


gTATTTCACA 


. . ,1 5 


7951 


CTCTgATgCC 


gCATAgTTAA 


ru 


8001 


CTgggTCATg 


gCTgCgCCCC 


rii 


8051 


gACgggCTTg 


TCTgCTCCCg 


;j 


8101 


TCCgggAgCT 


gCATgTgTCA 




8151 


gAggCAg 





ggCCACCACT T CAAgAACT C TgTAgCACCg 
AATCCTgTTA CCAgTggCTg CTgCCAgTgg 
ggTTggACTC AAgACgATAg TTAC CggATA 
ACggggggTT CgTgCACACA gCCCAgCTTg 
ACTgAgATAC CTACAgCgTg AgCATTgAgA 
ggAgAAAggC ggACAg gTAT CCggTAAgCg 
CgCACgAggg AgCTTCCAgg gggAAACgCC 
CgggTTTCgC CACCTCTgAC TTgAgCgTCg 
gggggCggAg CCTATggAAA AACgCCAgCA 
CTggCCTTTT gCTggCCTTT TgCTCACATg 
TgATTCTgTg gATAACCgTA TTACCgCCTT 
gCCgCAgCCg AACgACCgAg CgCAgCgAgT 
gAgCgCCTgA TgCggTATTT TCTCCTTACg 
CCgCATATgg TgCACTCTCA gTACAATCTg 
gCCAgTATAC ACTCCgCTAT CgCTACgTgA 
gACACCCgCC AACACCCgCT gACgCgCCCT 
gCATCCgCTT ACAgACAAgC TgTgACCgTC 
gAggTTTTCA CCgTCATCAC CgAAACgCgC 



95 



Table 251: pHIL-D2(MF«PrePro::EPI-HNE-3) 8584 b.p. 

DNA has SEQ ID NO. 071 ; Encoded polypeptide has SEQ ID NO. 072. DNA is circular 
and double stranded, only one strand is shown. Translation of the protein to be expressed 
is shown. 

1 2 3 4 5 

1234567890 1234567890 1234567890 1234567890 1234567890 
1 AgATCgCggC CgCgATCTAA CATCCAAAgA CgAAAggTTg AATgAAACCT 
51 TTTTgCCATC CgACATCCAC AggTCCATTC TCACACATAA gTgCCAAACg 
101 CAACAggAgg ggATACACTA gCAgCAgACC gTTgCAAACg CAggACCTCC 
151 ACTCCTCTTC TCCTCAACAC CCACTTTTgC CAT CgAAAAA CCAgCCCAgT 

2 01 TATTgggCTT gATTggAgCT CgCTCATTCC AATTCCTTCT ATTAggCTAC 
251 TAACAC CATg ACTTTATTAg CCTgTCTATC CTggCCCCCC TggCgAggTC 

3 01 ATgTTTgTTT ATTTCCgAAT gCAACAAgCT CCgCATTACA CCCgAACATC 

3 51 ACTCCAgATg AgggCTTTCT gAgTgTgggg TCAAATAgTT TCATgTTCCC 

4 01 AAATggCCCA AAACTgACAg TTTAAACgCT gTCTTggAAC CTAATATgAC 
451 AAAAgCgTgA TCTCATCCAA gATgAACTAA gTTTggTTCg .TTgAAATgCT 

5 01 AACggCCAgT TggTCAAAAA gAAACTTCCA AAAgTCgCCA TAC CgTTTgT 
551 CTTgTTTggT ATTgATTgAC gAATgCTCAA AAATAATCTC ATTAATgCTT 
601 AgCgCAgTCT CTCTATCgCT TCTgAACCCg gTggCACCTg TgCCgAAACg 
651 CAAATggggA AACAACCCgC TTTTTggATg ATTATg CATT gTCCTCCACA 
701 TTgTATgCTT CCAAgATTCT ggTgggAATA CTgCTgATAg CCTAACgTTC 
751 ATgATCAAAA TTTAACTgTT CTAACCCCTA CTTgAC Agg C AATATATAAA 
8 01 CAgAAggAAg CTgCCCTgTC TTAAACCTTT TTTTTTATCA TCATTATTAg 
851 CTTACTTTCA TAATTgCgAC TggTTCCAAT TgACAAgCTT TTgATTTTAA 
901 CgACTTTTAA CgACAACTTg AgAAg AT C AA AAAACAACTA ATTATTCgAA 

1 BstBI 
ACg 

^ MRFPSIFTAVLFA 

ATg AgA__TTC CC A TC T ATC TTC ACT gCT gTT TTg TTC gCT 
I | BsaBI | 

! 

!AS SALAAPVNTTTE 
27 

gCT TCC TCT gCT TTg gCT gCT CCA gTT AAC ACC ACT ACT gAA 
! Bpml Hpal Bbsl 

!D ETAQIPAEAVIGY 
41 

gAC gAg ACT gCT CAA ATT CCT gCT gAg gCT gTC ATC ggT TAC 
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! S 
55 


D 


L 


E 


G 


D 


F 


D V 


A 


V L P 


F 




TCT 

! 


gAC 


TTg 


gAA 


ggT 


gAC 


TTC 


qAC gTC 
Aatll 


gCT 


gTT TTg CCA 


TTC 




! S 

69 


N 


s 


T 


N 


N 


G 


L L 


F 


INT 


T 


10 


TCT 


AAC 


TCT 


ACT 


AAC 


AAC 


ggT 


TTg TTg 


TTC 


ATC AAC ACT 


ACC 


! I 

83 


A 


S 


I 


A 


A 


K 


E E 


G 


V S L 






ATC 

! 


gCT 


TCT 


ATC 


gCT 


gCT 


AAg 


gAg gAA ggT 


gTT TCC TTg 


gAC 


15 


! K 
91 


R 




A 


A 


C 


N 


L P 










AAg 


AgA 


i — 


gCT 


gCT 

Site 


TgT 

: Of 


AAC TTg CCA 
cleavage 
























%0 


! I 
105 


V 


R 


G 


P 


C 


I 


A F 


F 


P R W 


A 




ATC 


gTC 


AgA 


ggT 


CCA 


TqC 


ATT 


gCT TTC 


TTC 


CCA AgA Tgg 


gCT 










Nail 


is 


! F 
119 


D 


A 


V 


K 


G 


K 


C V 


L 


F P Y 


G 




TTC 

i 
i 


gAC 


gCT 


gTT 


AAg 


ggT 


AAg 


TgC gTC 


TTg 


TTC CCA TAC 
| PflMX 


ggT 


ho 

3 


! G 
133 


C 


Q 


G 


N 


G 


N 


K F 


Y 


S E K 


E 


il 


qqT TgT 
! PflVLl 


CAA 


ggT 


AAC 


ggT AAC 


AAg TTC 


TAC 


TCT gAg AAg gAg 


35 


I C 
141 


R 


E 


Y 


C 


G 


V 


P 










TgT 

! 


AgA gAg 


TAC 


TgT 


ggT 


gTT 


CCA TAg 


TAA 


qAATTCcrCCT 
EcoRI 





TAgACATg 

14 01 ACTgTTCCTC AgTTCAAgTT gggCATTACg AgAAgACCgg TCTTgCTAgA 
14 51 TTCTAATCAA gAggATgTCA gAATgCCATT TgCCTgAgAg ATgCAggCTT 
1501 CATTTTTgAT ACTTTTTTAT TTgTAACCTA TATAgTATAg gATTTTTTTT 
1551 gTCATTTTgT TTCTTCTCgT ACgAgCTTgC TCCTgATCAg CCTATCTCgC 
16 01 AgCTgATgAA TATCTTgTgg TAggggTTTg ggAAAAT CAT TCgAgTTTgA 
1651 TgTTTTTCTT ggTATTTCCC ACTCCTCTTC AgAgTACAgA AgATTAAgTg 
1701 AgAAgTTCgT TTgTgCAAgC TTATCgATAA gCTTTAATgC ggTAgTTTAT 
1751 CACAgTTAAA TTgCTAACgC AgTCAggCAC CgTgTATgAA AT CTAACAAT 
18 01 gCgCTCATCg TCATCCTCgg CACCgTCACC CTggATgCTg TAggCATAgg 
1851 CTTggTTATg CCggTACTgC CgggCCTCTT gCgggATATC gTCCATTCCg 
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19 01 ACAgCATCgC CAgTCACTAT ggCgTgCTgC TAgCgCTATA TgCgTTgATg 
1951 CAATTTCTAT gCgCACCCgT TCTCggAgCA CTgTCCgACC gCTTTggCCg 
2 001 CCgCCCAgTC CTgCTCgCTT CgCTACTTgg AgCCACTATC gACTACgCgA 
2051 TCATggCgAC CACACCCgTC CTgTggATCT ATCgAATCTA AATgTAAgTT 
2101 AAAATCTCTA AATAATTAAA TAAgTCCCAg TTTCTCCATA CgAACCTTAA 
2151 CAgCATTgCg gTgAgCATCT AgACCTTCAA CAgCAgCCAg ATCCATCACT 

22 01 gCTTggCCAA TATgTTTCAg TCCCTCAggA gTTACgTCTT gTgAAgTgAT 
2251 gAACTTCTgg AAggTTgCAg TgTTAACTCC gCTgTATTgA Cggg CAT AT C 

23 01 CgTACgTTgg CAAAgTgTgg TTggTACCgg AggAgTAATC TCCACAACTC 

23 51 TCTggAgAgT AggCACCAAC AAACACAgAT CCAgCgTgTT gTACTTgATC 

24 01 AACATAAgAA gAAgCATTCT CgATTTgCAg gAT CAAgTgT TCAggAgCgT 
2451 ACTgATTggA CATTTCCAAA gCCTgCTCgT AggTTgCAAC CgATAgggTT 
2 5 01 gTAgAgTgTg CAATACACTT gCgTACAATT TCAACCCTTg gCAACTgCAC 
2 551 AgCTTggTTg TgAACAgCAT CTTCAATTCT ggCAAgCTCC TTgTCTgTCA 
2 6 01 TATCgACAgC CAACAgAATC ACCTgggAAT CAATACCATg TTCAgCTTgA 
2 651 gCAgAAggTC TgAggCAACg AAATCTggAT CAgCgTATTT ATCAgCAATA 
2701 ACTAgAACTT CAgAAggCCC AgCAggCATg TCAATACTAC ACAgggCTgA 
2 751 TgTgTCATTT TgAACCATCA TCTTggCAgC AgTAACgAAC TggTTTCCTg 
2 8 01 gACCAAATAT TTTgT CACAC TTAggAACAg TTTCTgTTCC gTAAgCCATA 
2 851 gCAgCTACTg CCTgggCgCC TCCTgCTAgC ACgATACACT TAgCACCAAC 
2901 CTTgTgggCA ACgTAgATgA CTTCTggggT AAgggTACCA TCCTTCTTAg 

2 951 gTggAgATgC AAAAACAATT TCTTTgCAAC CAg CAACTTT ggCAggAACA 

3 0 01 CCCAgCATCA gggAAgTggA AggCAgAATT gCggTTCCAC CAggAATATA 
3 051 gAggCCAACT TTCTCAATAg gTCTTgCAAA ACgAgAgCAg ACTACAC CAg 
3101 ggCAAgTCTC AACTTgCAAC gTCTCCgTTA gTTgAgCTTC ATggAATTTC 
3151 CTgACgTTAT CTATAgAgAg ATCAATggCT CTCTTAACgT TATCTggCAA 

32 01 TTgCATAAgT TCCTCTgggA AAggAgCTTC TAACACAggT gTCTTCAAAg 
3 251 CgACTCCATC AAACTTggCA gTTAgTTCTA AAAgggCTTT gTCACCATTT 
3 3 01 TgACgAACAT TgTCgACAAT TggTTTgACT AATTCCATAA TCTgTTCCgT 

33 51 TTTCTggATA ggACgACgAA gggCATCTTC AATTTCTTgT gAggAggCCT 

34 01 TAgAAACgT C AATTTTgCAC AATT CAATAC gACCTTCAgA AgggACTTCT 
34 51 TTAggTTTgg ATTCTTCTTT AggTTgTTCC TTggTgTATC CTggCTTggC 
3 501 ATCTCCTTTC CTTCTAgTgA CCTTTAgggA CTTCATATCC AggTTTCTCT 
3 551 CCACCTCgTC CAACgTCACA CCgTACTTgg CACATCTAAC TAATgCAAAA 
3601 TAAAATAAgT CAgCACATTC CCAggCTATA TCTTCCTTgg ATTTAgCTTC 
3 651 TgCAAgTTCA TCAgCTTCCT CCCTAATTTT AgCgTTCAAC AAAACTTCgT 
3 701 CgTCAAATAA CCgTTTggTA TAAgAACCTT CTggAgCATT gCTCTTACgA 
3 751 TCCCACAAgg TgCTTCCATg gCTCTAAgAC CCTTTgATTg gCCAAAACAg 
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3801 


gAAgTgCgTT 


CCAAgTgACA 




3851 


TTTCAAgCAg 


TCTCCATCAC 




3901 


TTCgTCCAgA 


TgTAgCACCT 




3951 


TAgACTCCAg 


TTTgTgTCCT 


5 


4001 


gTACACCAgg 


CCCAACgAgT 




4051 


ATAgACCATC 


ggggcggTCA 




4101 


ACAgggAACT 


TTTTgACATC 




4151 


CCgAgCATCA 


ATAATggg g A 




4201 


CTACCAACTT 


TgCggTCTCA 


10 


4251 


TTAgTgAAAC 


TTTTCAAATC 




4301 


ACTAgCATTA 


gCgggCAAgg 




4351 


TAAC C CTAgC 


gCCTgggATC 


2 


4401 


AggTCCAAAA 


TCACTTCATT 


ill 


4451 


CATTAACTTg 


AAgCTCAgTC 


15 


4501 


TggTCAgCAg 


CATAgggAAA 




4551 


ATCAAACTCT 


gCAACACTTg 




4601 


TTgAAgTCgg 


ACAgTgAgTg 




4651 


TATTATCAgT 


gAgTCAgTCA 




4701 


ggCCggCATC 


ACCggCgCCA 




4751 


ACATCACCgA 


TggggAAgAT 


3 


4801 


TgTTTCggCg 


TgggTATggT 


in 


4851 


CgCCATCTCC 


TTgCATgCAC 




4901 


TCAACCTACT 


ACTgggCTgC 




4951 


CgTCgAgTAT 


CTATgATTgg 


25 


5001 


CAgTgTCTTg 


AggTCTCCTA 




5051 


gAggAgATTT 


CATggTAAAT 




5101 


AACTgTgAgA 


CTATCTCggT 




5151 


AgTAAATgAA 


gTCCCACCAA 




5201 


TCTTcrTTTCa 


AACTTTTTCg 


30 




BstBI 




5251 


ATgTCgggTA 


ggAATggAgC 




5301 


AggTATgTAg ggTTTgTAgA 




5351 


TATTTCgTTC AAACCATTCC 


35 


5401 


CTATTgATCC AAgCCAgTgC 


5451 


TTTTgAggTC 


ATCTTTgTAT 




5501 


TTgACgAgCC AAggCgATAA 




5551 


TAAAAggACA AgTATgTCTg 
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gAAACCAACA CCTgTTTgTT CAACCACAAA 
AATCCAATTC g AT AC C C Ag C AACTTTTgAg 
TTATACCACA AAC CgTgACg ACgAgATTgg 
TATAgCCTCC ggAATAgACT TTTTggACgA 
AATTAgAAgA gTCAgCCACC AAAgTAgTgA 
gTAgTCAAAg ACgCCAACAA AATTTCACTg 
TTCAgAAAgT TCgTATTCAg TAgTCAATTg 
TTATACCAgA AgCAACAgTg gAAgT CACAT 
g AAAAAg CAT AAACAgTTCT ACTACCgCCA 
gCCCAgTggA gAAgAAAAAg gCACAgCgAT 
ATgCAACTTT ATCAACCAgg gTCCTATAgA 
ATCCTTTggA CAACTCTTTC TgCCAAATCT 
gATACCATTA TACggATgAC TCAACTTgCA 
gATTgAgTgA ACTTgATCAg gTTgTgCAgC 
CACggCTTTT CCTACCAAAC TCAAggAATT 
CgTATgCAgg TAgCAAgggA AATgTCATAC 
TAgTCTTgAg AAATTCTgAA gCCgTATTTT 
T C AggAg AT C CTCTACgCCg gACgCATCgT 
CAggTgCggT TgCTggCgCC TATATCgCCg 
CgggCTCgCC ACTTCgggCT CATgAgCgCT 
ggCAggCCCC gTggCCgggg gACTgTTggg 
CATTCCTTgC ggCggCggTg CTCAACggCC 
TTCCTAATgC AggAgTCgCA TAAgggAgAg 
AAgTATgggA ATggTgATAC CCgCATTCTT 
TCAgATTATg CCCAACTAAA gCAACCggAg 
TTCTCTgACT TTTggTCATC AgTAgACTCg 
T ATgACAg C A gAAATgTCCT TCTTggAgAC 
TAAAgAAATC CTTgTTATCA ggAACAAACT 
gTgCCTTgAA CTATAAAATg TAgAgTggAT 

gggCAAATgC TTACCTTCTg gACCTTCAAg 
TACTgATgCC AACTTCAgTg ACAACgTTgC 
gAATCCAgAg AAATCAAAgT TgTTTgTCTA 
ggTCTTgAAA CTgACAATAg TgTgCTCgTg 

gAATAAATCT AgTCTTTgAT CTAAATAATC 
AT AC C C AAAT CTAAAACTCT TTTAAAACgT 
CCTgTATTAA ACCCCAAATC AgCTCgTAgT 
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5601 CTgATCCTCA TCAACTTgAg 
5651 gAgATgCgAT AT CgAgAAAA 
57 01 CTCAAgATCg CggCCgCgAT 
5751 CCCgATCTgC gTCTATTTCA 
5 58 01 TAAgCTgTCA AACATgAgAA 

5851 AAATCTTgAA gACgAAAggg 
5901 TgTCATgATA ATAATggTTT 





5951 


ATgTgCgCgg AACCCCTATT 


10 


6001 


TATCCgCTCA TgAgACAATA 




6051 


AAggAAgAgT 


ATgAgTATTC 


!ir 


6101 


TTgCggCATT 


TTgCCTTCCT 


3 


6151 


gTAAAAgATg 


CTgAAgATCA 


m 


6201 


ggATCTCAAC AgCggTAAgA 


ru 


6251 


TTCCAATgAT 


gAgCACTTTT 


6301 


CgTgTTgACg 


CCgggCAAgA 


ry 


6351 


gAATgACTTg 


gTTgAgTACT 


cs 


6401 


gCATgACAgT AAgAgAATTA 




6451 


ACTgCggCCA 


ACTTACTTCT 


fi 

m 
. 


6501 


CgCTTTTTTg 


CACAACATgg 


6551 


AACCggAgCT 


gAATgAAgCC 




6601 


CCTgCAgCAA TggCAACAAC 




6651 


TACTCTAgCT 


TCCCggCAAC 




6701 


TTgCAggACC 


ACTTCTgCgC 


25 


6751 


gATAAATCTg 


gAgCCggTgA 




6801 


ggggCCAgAT 


ggTAAgCCCT 




6851 


gTCAggCAAC 


TATggATgAA 




6901 


TCACTgATTA AgCATTggTA 




6951 


TTAgATTgAT 


TT7AAATTgTA 


30 


7001 


AATTTTTgTT AAATCAgCTC 




7051 


AATCCCTTAT 


AAATCAAAAg 




7101 


CAgTTTggAA 


CAAgAgTCCA 




7151 


gggCgAAAAA 


CCgTCTATCA 




7201 


CTAATCAAgT 


TTTTTggggT 


35 


7251 


CTAAAgggAg 


CCCCCgATTT 




7301 


gCgAgAAAgg 


AAgggAAgAA 




7351 


AAgTgTAgCg 


gTCACgCTgC 




7401 


CgCCgCTACA 


gggCgCgTAA 



gggCACTATC TTgTTTTAgA gAAATTTgCg 
AggTACgCTg ATTTTAAACg TgAAATTTAT 
CTCgAATAAT AACTgTTATT TTTCAgTgTT 
CAATACCAAC ATgAgTCAgC TTATCgATgA 
TTAATTCgAT gATAAgCTgT CAAACATgAg 
CCTCgTgATA CgCCTATTTT TATAggTTAA 
CTTAgACgTC AggTggCACT TTT CggggAA 
Aatll 

TgTTTATTTT TCTAAATACA TTCAAATATg 
ACCCTgATAA ATgCTTCAAT AATATTgAAA 
AACATTTCCg TgTCgCCCTT ATTCCCTTTT 
gTTTTTgCTC ACCCAgAAAC gCTggTgAAA 
gTTgggTgCA CgAgTgggTT ACATCgAACT 
TCCTTgAgAg TTTTCgCCCC gAAgAACgTT 
AAAgTTCTgC TATgTggCgC ggTATTATCC 
gCAACT CggT CgCCgCATAC ACTATTCTCA 
CACCAgTCAC AgAAAAgCAT CTTACggATg 
TgCAgTgCTg CCATAAC CAT gAgTgATAAC 
gACAACgATC ggAggACCgA AggAgCTAAC 
gggATCATgT AACTCgCCTT gATCgTTggg 
ATACCAAACg ACgAgCgTgA CACCACgATg 
gTTgCgCAAA CTATTAACTg gCgAACTACT 
AATTAATAgA CTggATggAg gCggATAAAg 
TCggCCCTTC CggCTggCTg gTTTATTgCT 
gCgTgggTCT CgCggTATCA TTgCAgCACT 
CCCgTATCgT AgTTATCTAC ACgACggggA 
CgAAATAgAC AgATCgCTgA gATAggTgCC 
ACTgTCAgAC CAAgTTTACT CATATATACT 
7AACgTTAATA TTTTgTTAAA ATTCgCgTTA 
ATTTTTTAAC CAATAggCCg AAATCggCAA 
AATAgACCgA gATAgggTTg AgTgTTgTTC 
CTATTAAAgA ACgTggACTC CAACgTCAAA 
gggCgATggC CCACTACgTg AACCATCACC 
CgAggTgCCg TAAAgCACTA AATCggAACC 
AgAgCTTgAC ggggAAAgCC ggCgAACgTg 
AgCgAAAggA gCgggCgCTA gggCgCTggC 
g CgTAAC CAC CACACCCgCC gCgCTTAATg 
AAggATCTAg gTgAAg AT C C TTTTTgATAA 
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7451 
7501 
7551 
7601 
7651 
7701 
7751 
7801 
7851 
7901 
7951 
8001 
8051 
8101 
8151 
8201 
8251 
8301 
8351 
8401 
8451 
8501 
8551 



TCTCATgACC 
ACCCCgTAgA 
gTAATCTgCT 
TTTgCCggAT 
gCAgAgCgCA 
CACCACTTCA 
CCTgTTACCA 
TggACTCAAg 
gggggTTCgT 
gAgATACCTA 
gAAAggCggA 
ACgAgggAgC 
gTTTCgCCAC 
ggCggAgCCT 
gCCTTTTgCT 
TTCTgTggAT 
gCAgCCgAAC 
CgCCTgATgC 
CATATggTgC 
AgTATACACT 
ACCCgCCAAC 
TCCgCTTACA 
gTTTTCACCg 



AAAATCCCTT 
AAAgATCAAA 
gCTTg CAAAC 
CAAgAgCTAC 
gATACCAAAT 
AgAACTCTgT 
gTggCTgCTg 
ACgATAgTTA 
gCACACAgCC 
CAgCgTgAgC 
CAggTATCCg 
TTCCAggggg 
CTCTgACTTg 
ATggAAAAAC 
ggCCTTTTgC 
AAC CgTATTA 
gACCgAgCgC 
ggTATTTTCT 
ACTCTCAgTA 
CCgCTATCgC 
ACCCgCTgAC 
gACAAgCTgT 
TCATCACCgA 



AACgTgAgTT 
ggATCTTCTT 
AAAAAAACCA 
CAACTCTTTT 
ACTgTCCTTC 
AgCACCgCCT 
CCAgTggCgA 
CCggATAAgg 
CAgCTTggAg 
ATTgAgAAAg 
gTAAgCggCA 
AAACgCCTgg 
AgCgTCgATT 
gCCAgCAACg 
TCACATgTTC 
CCgCCTTTgA 
AgCgAgTCAg 
CCTTACgCAT 
CAATCTgCTC 
TACgTgACTg 
gCgCCCTgAC 
gACCgTCTCC 
AACgCgCgAg 



TTCgTTCCAC 
gAgATCCTTT 
CCgCTACCAg 
TCCgAAggTA 
TAgTgTAgCC 
ACATACCTCg 
TAAgTCgTgT 
CgCAgCggTC 
CgAACgACCT 
CgCCACgCTT 
gggTCggAAC 
TATCTTTATA 
TTTgTgATgC 
CggCCTTTTT 
TTTCCTgCgT 
gTgAgCTgAT 
TgAgCgAggA 
CTgTgCggTA 
TgATgCCgCA 
ggTCATggCT 
gggCTTgTCT 
gggAgCTgCA 
gCAg 



TgAgCgTCAg 
TTTTCTgCgC 
CggTggTTTg 
ACTggCTTCA 
gTAgTTAggC 
CTCTgCTAAT 
CTTAC CgggT 
gggCTgAACg 
ACACCgAACT 
CCCgAAgggA 
AggAgAgCgC 
gTCCTgTCgg 
TCgTCAgggg 
ACggTTCCTg 
TATCCCCTgA 
ACCgCTCgCC 
AgCggAAgAg 
TTTCACACCg 
TAgTTAAgCC 
gCgCCCCgAC 
gCTCCCggCA 
TgTgTCAgAg 



Restriction map of pHIL-D2 (MFaPrePro : : EPI -HNE-3 ) 



Non-cutters 

Aflll Apal 
BanMI Bgl II 

Nrul Pad 
Sfil . SnaBI 



AscI 
ESSHII 
Pmll 
Spel 



Aval 
BstEII 
RsrII 
Xhol 



Avrll 
Mlul 
SacII 
Xmal 



Cutters, 3 or fewer sites 

Aatll 2 1098 5925 

AflXXl 1 8173 

Agel 1 1436 

AlwNJ 3 2828 2852 7759 



ApaLI 
Asel 
Bgl I 
BsaAI 



3 6176 7859 8357 

3 591 5820 6672 

3 284 2717 6724 

2 7185 8421 
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Bsgl 


2 


2545 


4494 


Pvul 


1 


6476 




BsiWI 


2 


1568 


2301 


PvuII 


2 


1600 


4497 


BspDl 


2 


1723 


5793 


Sac I 


1 


216 




BspEX 


1 


3978 




Sail 


1 


3312 




BspMI 


1 


4576 




Seal 


2 


1360 


6365 


Bsfcll07I 


1 


8402 




Spill 


1 


4863 




BBtBI (AsuII) 


2 


945 


5207 


Sspl 


3 


2806 


6041 


BstXI 


3 


711 


2765 2896 


Sfcul 


1 


3395 




Bsu36X 


1 


2223 




Tfchllll 


1 


8426 












Xbal 


1 


2168 




Drain 


2 


3754 


7182 




















1 


711 





EagI 3 7 5711 8591 





Eamll05I 


2 


5077 


6843 




Ecll36I 


1 


216 




;: - 


Eco47III 


2 


1932 


4795 


ill 


EcoNI 


3 


3433 


4923 5293 




EcoRX 


1 


1383 






EcoRV 


2 


1885 


5658 




Esp3I (Bsal) 


2 


3120 


8524 


~r 








frl 


Espl (Bpull02I) 


1 


597 




iio 

ru 


Fspl 


2 


1960 


6623 




Hindi I I 


3 


885 


1717 1729 




Hpal 


2 


1017 


2272 




Kpnl 


2 


2323 


2934 




MscI 


2 


2204 


3789 


25 


NcoX 


1 


3766 






NdeX 


1 


8351 






NgoMl 


2 


4702 


7288 




Nhel 


2 


1929 


2875 




No tl 


3 


6 


5710 8590 


30 


Nsil 


2 


684 


1241 




PflMI 


2 


196 


1302 




PmeX 


1 


420 






PpuMI 


2 


142 


4339 




Psfcl 


1 


6602 
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Table 252: BstBI -Aa. tl I-EcoRl cassette for expression of EPI- 
HNE-4 

DNA has SEQ ID NO. 073; amino-acid sequence has SEQ ID NO. 
5 074 

I MRFPSIFT 
5 ' TTCaAA ACg ATg AaA T TC CC A TC T ATC TTC ACT 
BstBI | SsaBI | 



10 ! A V L F A 13 

gCT gTT TTg TTC gCT 





! A 
27 


S 


s 


A 


L 


A 


A 


P 


V 


N 


T 


T 


T 


E 




gCT 


TCC 


TCT 


gCT 


TTg 


gCT 


gCT 


CCA 


qTT 


AAC 


ACC 


ACT 


ACT 


gAA 




; 
! 












Bpml 


Hpal 








Bbsl 




! D 
41 


E 


T 


A 


Q 


I 


P 


A 


E 


A 


V 


I 


G 


Y 


:;|o 

ru 


qAC gAg 
! Bbsl 


ACT 


gCT 


CAA 


ATT 


CCT 


gCT 


gAg 


gCT 


gTC 


ATC 


ggT 


TAC 




! S 
55 


D 


L 


E 


G 


D 


F 


D 


V 


A 


V 


L 


P 


F 


Cl 
m 


TCT 


gAC 


TTg 


gAA 


ggT 


gAC 


TTC 


crAC crTC 
Aatll 


gCT 


gTT 


TTg 


CCA 


TTC 


Q 

m 


! S 
69 


N 


S 


T 


N 


N 


G 


L 


L 


F 


I 


N 


T 


T 


30 


TCT 

i 


AAC 


TCT 


ACT 


AAC 


AAC 


ggT 


TTg 


TTg 


TTC 


ATC 


AAC 


ACT 


ACC 




! I 
83 


A 


S 


I 


A 


A 


K 


E 


E 


G 


V 


S 


L 


D 


35 


ATC 

i 


gCT 


TCT 


ATC 


gCT 


gCT 


AAg 


gAg 


gAA 


ggT 


gTT 


TCC 


TTg 


gAC 


! K 
91 


R 


E 


A 


c 


N 


L 


p 
















AAg 

i 


AgA 


gAg 


gCT 


TgT 


AAC 


TTg 


CCA 














40 


! I 
105 


V 


R 


G 


P 


C 


I 


A 


F 


F 


P 


R 


W 


A 




ATC 


gTC 


AgA 


ggT 


CCA 


TgC 


ATT 


gCT 


TTC 


TTC 


CCA AaA 


Tgg 


gCT 










Nsil 










45 


! F 
119 


D 


A 


V 


K 


G 


K 


c 


V 


L 


F 


P 


Y 


G 




TTC 

! 
! 


gAC 


gCT 


gTT 


AAg 


ggT AAg 


TgC 


gTC 


TTg 


TTC 


CCA TAC 
PflMI 


ggT 


50 


! G 
133 


C 


Q 


G 


N 


G 


N 


K 


F 


Y 


S 


E 


K 


E 




crgT TgT 
! PflMI 


CAA 


ggT 


AAC 


ggT 


AAC 


AAg 


TTC 


TAC 


TCT 


gAg AAg 


gAg 



Table 251, continued 103 

ICREYCGVP. 
141 

TgT AgA gAg TAC TgT ggT gTT CCA TAg TAA crAATTC 
! EcoRI 

The DNA is a linear fragment that is double stranded in vivo, only one strand is shown. 

The amino acid sequence is that of a disulfide-containing protein that is processed in vivo. 
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Table 253: pD2pick (MFaPrePro : : EPI -HNE-3 ) , 8590 bp, CIRCULAR 
dsDNA, one strand shown. pD2pick (MFaPrePro : : EPI -HNE-3) DNA 
has SEQ ID NO. 075 Encoded protein has SEQ ID NO. 076 

1 2 3 4 5 

1234567890 1234567890 1234567890 1234567890 1234567890 

1 AgATCgCggC CgCgATCTAA CAT C C AAAgA CgAAAggTTg AATgAAACCT 

51 TTTTgCCATC CgACATCCAC AggTCCATTC T C AC AC AT AA gTgCCAAACg 

101 CAACAggAgg ggATACACTA gCAgCAgACC gTTgCAAACg CAggACCTCC 

151 ACTCCTCTTC TCCTCAACAC CCACTTTTgC CAT CgAAAAA CCAgCCCAgT 

2 01 TATTgggCTT gATTa aAaCT C aCTCATTCC AATTCCTTCT ATTAggCTAC 

Sad 

2 51 TAACACCATg ACTTTATTAg CCTgTCTATC CTggCCCCCC TggCgAggTC 

3 01 ATgTTTgTTT ATTTCCgAAT gCAACAAgCT CCgCATTACA CCCgAACATC 
351 ACTCCAgATg AgggCTTTCT gAgTgTgggg TCAAATAgTT TCATgTTCCC 

4 01 AAATggCCCA AAACTgAC Aq TTTAAAC aCT gTCTTggAAC CTAATATgAC 

Pmel 

4 51 AAAAgCgTgA TCTCATCCAA gATgAACTAA gTTTggTTCg TTgAAATgCT 
501 AACggCCAgT TggTCAAAAA gAAACTTCCA AAAgTCgCCA TACCgTTTgT 
551 CTTgTTTggT ATTgATTgAC gAATgCTCAA AAATAATCTC 
ATTAAT qCTTAqC 

Espl 

6 04 gCAgTCT CTCTATCgCT TCTgAACCCg gTggCACCTg TgCCgAAACg 

651 CAAATggggA AACAACCCgC TTTTTggATg ATTATgCATT gTCCTCCACA 
701 TTgTATgCTT CCAAaATTCT aa TaaaAATA CTgCTgATAg CCTAACgTTC 
XcmT 

751 ATgAT CAAAA TTTAACTgTT CTAACCCCTA CTTgACAggC AATATATAAA 
801 CAgAAggAAg CTgCCCTgTC TTAAACCTTT TTTTTTATCA TCATTATTAg 
851 CTTACTTTCA TAATTgCgAC TggTTCCAAT TgACAAgCTT TTgATTTTAA 
901 CgACTTTTAA CgACAACTTg AgAAgATCAA AAAACAACTA ATTATTCgAA 
! BstBI 
951 ACg 

i 

! MRFPSIFTAVLFA 
954 ATg AgA TTC CCA TCT ATC TTC ACT gCT gTT TTg TTC gCT 

! ASSALAAPVNTTT 



Table 253, continued 



105 



993 


gCT 


TCC 


TCT 


gCT 


TTg 


gCT 


gCT 


CCA 


gTT 


AAC 


ACC 


ACT 


ACT 


i 

1032 

i 


E 
gAA 


D 

gAC 


E 

gAg 


T 
ACT 


A 
gCT 


Q 

CAA 


i 

ATT 


P 
CCT 


A 
gCT 


E 
gAg 


A 
gCT 


V 

gTC 


I 

ATC 


1071 


G 
ggT 


Y 
TAC 


s 

TCT 


D 
gAC 


L 
TTg 


E 
gAA 


G 

ggT 


D 
gAC 


F 
TTC 


D V 
q-AC aTC 

Aatll 


A 

gCT 


V 
gTT 


1110 


L 
TTg 


P 
CCA 


F 
TTC 


S 
TCT 


N 
AAC 


S 
TCT 


T 
ACT 


N 
AAC 


N 
AAC 


G 
ggT 


L 
TTg 


L 
TTg 


F 
TTC 


1149 


I 

ATC 


N 
AAC 


T 
ACT 


T 
ACC 


I 

ATC 


A 

gCT 


s 

TCT 


I 

ATC 


A 

gCT 


A 
gCT 


K 
AAg 


E 
gAg 


E 
gAA 


1188 


G 
ggT 


V 
gTT 


S 
TCC 


L 
TTg 


D 
gAC 


K 
AAg 


R 
AgA 


A 

gCT 


A 

gCT 


C 
TgT 


N 
AAC 


L 
TTg 


P 
CCA 


1227 


I 

ATC 


V 
gTC 


R 
AgA 


G 
ggT 


P 

CCA 


C 
TgC 


I 

ATT 


A 
gCT 


F 
TTC 


F 
TTC 


P 
CCA 


R 
AgA 


W 


1266 


A 
gCT 


F 


D 


A 
gCT 


V 
gTT 


K 


G 
ggT 


K 


C 
TgC 


V 
gTC 


L 
TTg 


F 
TTC 


P 
CCA 


1305 


Y 
TAC 


G 
ggT 


G 

ggT 


C 
TgT 


Q 
CAA 


G 
ggT 


N 
AAC 


G 
ggT 


N 
AAC 


K 
AAg 


F 
TTC 


Y 
TAC 


S 
TCT 


1344 


E 

gAg 


K 
AAg 


E 

gAg 


C 
TgT 


R 
AgA 


E 

gAg 


Y 
TAC 


C 
TgT 


G 
ggT 


V 
gTT 


P 
CCA 


TAg 


TAA 



13 83 gAATTC gC CTTAgACATg 

! EcoRI 

14 01 ACTgTTCCTC AgTTCAAgTT gggCATTACg AgAAa ACCqg T CTTgCTAgA 

Aegl 

14 51 TTCTAATCAA gAggATgTCA gAATgCCATT TgCCTgAgAg ATgCAggCTT 

15 01 CATTTTTgAT ACTTTTTTAT TTgTAACCTA TATAgTATAg gATTTTTTTT 
1551 gT CATTTTgT TTCTTCTCgT ACgAgCTTgC TCCTgATCAg CCTATCTCgC 
1601 Ag CTgATgAA TATCTTgTgg TAggggTTTg ggAAAATCAT TCgAgTTTgA 
1651 TgTTTTTCTT ggTATTTCCC ACTCCTCTTC AgAgTACAgA AgATTAAgTg 

17 01 AgAAgTTCgT TTgTgCAAgC TTATCgATAA gCTTTAATgC ggTAgTTTAT 
1751 CACAgTTAAA TTgCTAACgC AgTCAggCAC CgTgTATgAA ATCTAACAAT 

18 01 gCgCTCATCg TCATCCTCgg CACCgTCACC CTggATgCTg TAggCATAgg 
1851 CTTggTTATg CCggTACTgC CgggCCTCTT gCgggATATC gTCCATTCCg 
1901 ACAg CAT Cg C CAgTCACTAT ggCgTgCTgC TAgCgCTATA TgCgTTgATg 

19 51 CAATTTCTAT gCgCACCCgT TCTCggAgCA CTgTCCgACC gCTTTggCCg 
2 0 01 CCgCCCAgTC CTgCTCgCTT CgCTACTTgg AgCCACTATC gACTACgCgA 
2 051 TCATggCgAC CACACCCgTC CTgTggATCT ATCgAATCTA AATgTAAgTT 
2101 AAAATCTCTA AATAATTAAA TAAgTCCCAg TTTCTCCATA CgAAC CTTAA 
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2151 CAgCATTgCg CfTcrAgC ATCT AqA CCTTCAA CAgCAgCCAg ATCCATCACT 
Xbal 

22 01 gCTTggCCAA TATgTTTCAg TCCCTCAggA gTTACgTCTT gTgAAgTgAT 

Bsu3 6I 

2251 gAACTTCTgg AAggTTgCAg TgTTAACTCC gCTgTATTgA CgggCATATC 

23 01 CgTACgTTgg CAAAgTgTgg TTggTACCgg AggAgTAATC TCCACAACTC 

23 51 T CTgg AgAgT AggCACCAAC AAACACAgAT CCAgCgTgTT gTACTTgATC 

24 01 AACATAAgAA gAAgCATTCT CgATTTgCAg gAT CAAgTgT TCAggAgCgT 

24 51 ACTgATTggA CATTTCCAAA gCCTgCTCgT AggTTgCAAC CgATAgggTT 

25 01 gTAgAgTgTg CAATACACTT g CgTAC AATT TCAACCCTTg gCAACTgCAC 
2 551 AgCTTggTTg TgAACAgCAT CTTCAATTCT ggCAAgCTCC TTgTCTgTCA 
2 601 TATCgACAgC CAACAgAATC ACCTgggAAT CAATAC CATg TTCAgCTTgA 
2651 gCAgAAggTC TgAggCAACg AAATCTggAT CAgCgTATTT ATCAgCAATA 
2 701 ACTAgAACTT CAgAAggCCC AgCAggCATg TCAATACTAC ACAgggCTgA 
2 751 TgTgTCATTT TgAACCATCA TCTTggCAgC AgTAACgAAC TggTTTCCTg 
28 01 gACCAAATAT TTTgTCACAC TTAggAACAg TTTCTgTTCC gTAAg C CAT A 
2 851 gCAgCTACTg CCTgggCgCC TCCTgCTAgC ACgATACACT TAgCACCAAC 
2 901 CTTgTgggCA ACgTAgATgA CTTCTggggT AAgggTACCA TCCTTCTTAg 

2 951 gTggAgATgC AAAAACAATT TCTTTgCAAC CAgCAACTTT ggCAggAACA 

3 001 CCCAgCATCA gggAAgTggA AggCAgAATT gCggTTCCAC CAggAATATA 
3 051 gAggCCAACT TTCTCAATAg gTCTTgCAAA ACgAgAgCAg ACTACACCAg 
3101 ggCAAgTCTC AACTTg CAAC gTCTCCgTTA gTTgAgCTTC ATggAATTTC 
3151 CTgACgTTAT CTATAgAgAg ATCAATggCT CTCTTAACgT TATCTggCAA 
32 01 TTgCATAAgT TCCTCTgggA AAggAgCTTC TAACACAggT gTCTTCAAAg 
3251 CgACTCCATC AAACTTggCA gTTAgTTCTA AAAgggCTTT gTCACCATTT 
3 3 01 TgACgAACAT TgTCgACAAT TggTTTgACT AATT C CATAA TCTgTTCCgT 
3351 TTTCTggATA ggACgACgAA gggCATCTTC AATTTCTTgT gAggAggCCT 

Still 

3401 TAgAAACgTC AATTTTgCAC AATT CAATAC gACCTTCAgA AgggACTTCT 
34 51 TTAggTTTgg ATTCTTCTTT AggTTgTTCC TTggTgTATC CTggCTTggC 
3 501 ATCTCCTTTC CTTCTAgTgA CCTTTAgggA CTTCATATCC AggTTTCTCT 
3 551 CCACCTCgTC CAACgTCACA CCgTACTTgg CACATCTAAC TAATgCAAAA 
3 601 TAAAATAAgT CAgCACATTC CCAggCTATA TCTTCCTTgg ATTTAgCTTC 
3651 TgCAAgTTCA TCAgCTTCCT CCCTAATTTT AgCgTTCAAC AAAACTTCgT 
3 701 CgT CAAATAA CCgTTTggTA TAAgAACCTT CTggAgCATT gCTCTTACgA 
3 751 T C C CACAAgg TgCTTCCATg_gCTCTAAgAC CCTTTgATTg gCCAAAACAg 
Ncol 
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3801 


gAAgTgCgTT 


CCAAgTgACA 




3851 


TTTCAAgCAg 


TCTCCATCAC 




3901 


TTCgTCCAgA 


TgTAgCACCT 


■; 5 


3951 


TAgACTCCAg 


TTTgTgTCCT 




4001 




P 1 c n a a rv-r »«t 
L. L. UAAUgAg 1 




4 051 




ggggcggTCA 






ACAgggAACT 


TTTTgACATC 




4151 


C Crr a <~r P 1 A T P 1 a 


ATAATggggA 


10 


4201 


pta ppsj ptt 

V- X.f-iL.L..firlV_, X X 


TgCggTCTCA 




4251 


x x iig x giifLH.v_ 


TTTTCAAATC 




4301 


i-i>— Xjhigv^AX X/i 


gCgggCAAgg 






Ta ar" , P 1 P 1r na/->rP 1 

X.fLrlV_V_V_ lACjL 


gCCTgggATC 


«i 


4401 


anaTrra a a a 
x^y y x v_v_.tt^i/ift. 


TP 1 a PTTPATT 

1 At_ X 1 L. A X 1 


%|_ 5 

=1.1 




v—h. x x x x g 


AAgCTCAgTC 


ni 




TggTCAgCAg 


CATAgggAAA 






ATCAAACTCT 


gCAACACTTg 


Q 
h= 


4601 


TTgAAgTCgg 


ACAgTgAgTg 






TATTAT CAgT 


gAgTCAgTCA 


1 ; 2 0 




gg C Cgg CAT C 


ACCgg Cg C CA 


rii 




auai A.v_v_g.A. 


TggggAAgAT 






TgTTTCggCg 


TgggTATggT 






HrtCC a TCTTT' 
LyLv-Hl V- X 


TTgCATgCAC 




4901 


TPaappTSPT 


al x gggc x gc 


25 




CgTCgAgTAT 


CTATgATTgg 






CAgTgTCTTg 


AggTCTCCTA 




5051 


gAggAgAT T T 


CATggTAAAT 




5101 


A A P'Trr'Trr a rv a 

ririv^ x y x yi-Ly i-i. 


^ x ri. x i„ x gg x 






AgTAAATgAA 


gTCCCACCAA 


30 


52 01 


xv—x xyx x x 


PWa A P»T"T" I n | n i I 

g/iA.i_ x x x x x 




5251 


ATATgTCggg 


TAggAATgg A 




5301 


a rr a rrrrT a T'rrT 1 

.«.y.ri.yy xrixy x 


AgggTTTgTA 






gCTATTTCgT 


TCAAACCATT 






TACTATTgAT 


CCAAgCCAgT 


35 


5451 


TgTTTTgAgg 


TCATCTTTgT 




5501 


TCTTgACgAg 


CCAAggCgAT 




5551 


gTTAAAAggA 


CAAgTATgTC 




5601 


gTCTgATCCT 


CATCAACTTg 



gAAACCAACA CCTgTTTgTT CAACCACAAA 
AATCCAATTC g AT AC C C Ag C AACTTTTgAg 
TTATACCACA AACCgTgACg ACgAgATTgg 
TATAgCCTCC_ggAATAgACT TTTTggACgA 
BspEI 

AATTAgAAgA gTCAgCCACC AAAgTAgTgA 
gTAgTCAAAg ACgCCAACAA AATTTCACTg 
TTCAgAAAgT TCgTATTCAg TAgTCAATTg 
TTATACCAgA AgCAACAgTg gAAgTCACAT 
gAAAAAgCAT AAACAgTTCT ACTACCgCCA 
gCCCAgTggA gAAgAAAAAg gCACAgCgAT 
ATgCAACTTT ATCAACCAgg gTCCTATAgA 
ATCCTTTggA CAACTCTTTC TgCCAAATCT 
gATACCATTA TACggATgAC TCAACTTgCA 
gATTgAgTgA ACTTgATCAg gTTgTgCAgC 
CACggCTTTT CCTACCAAAC TCAAggAATT 
CgTATgCAgg TAg CAAgggA AATgTCATAC 
TAgTCTTgAg AAATTCTgAA gCCgTATTTT 
TCAggAgATC CTCTACgCCg gACgCATCgT 
CAggTgCggT TgCTggCgCC TATATCgCCg 
CgggCTCgCC ACTTCgggCT CATgAgCgCT 
ggCAggCCCC gTggCCgggg gACTgTTggg 
CATTCCTTgC ggCggCggTg CTCAACggCC 
TTCCTAATgC AggAgTCgCA TAAgggAgAg 
AAgTATgggA ATggTgATAC CCgCATTCTT 
TCAgATTATg CCCAACTAAA gCAACCggAg 
TTCTCTgACT TTTggTCATC AgTAgACTCg 
TATgACAg C A gAAATgTCCT TCTTggAgAC 
TAAAg AAAT C CTTgTTATCA ggAACAAACT 
CggTgCCTTg AACTATAAAA TgTAgAgTgg 
gCgggCAAAT gCTTACCTTC TggACCTTCA 
gATACTgATg CCAACTTCAg TgACAACgTT 
CCgAATCCAg AgAAAT C AAA gTTgTTTgTC 
gCggTCTTgA AACTgACAAT AgTgTgCTCg 
ATgAATAAAT CTAgTCTTTg ATCTAAATAA 
AAATACCCAA ATCTAAAACT CTTTTAAAAC 
TgCCTgTATT AAACCCCAAA TCAgCTCgTA 
AggggCACTA TCTTgTTTTA gAgAAATTTg 
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5651 CggAgATgCg ATATCgAgAA 
5701 ATCTCAAgAT CgCggCCgCg 

5 751 TTCCCgATCT gCgTCTATTT 
5801 gATAAgCTgT CAAACATgAg 

,5 5851 AgAAATCTTg AAgACgAAAg 
5901 AATgTCATgA TAATAATggT 
5951 ggggAAATgT gCgCggAACC 
6001 AATATgTATC CgCTCATgAg 
6051 TTgAAAAAgg AAgAgTATgA 
10 6101 CCTTTTTTgC ggCATTTTgC 

6151 gTgAAAgTAA AAgATg CTg A 
6201 CgAACTggAT CTCAACAgCg 
62 51 AACgTTTTCC AATgATgAgC 
W 63 01 TTATCCCgTg TTgACgCCgg 

$.5 6351 TTCTCAgAAT gACTTggTTg 
fy 64 01 CggATggCAT gACAgTAAgA 

=W 64 51 gATAACACTg CggCCAACTT 

p 65 01 gCTAACCgCT TTTTTgCACA 

6551 gTTgggAACC ggAgCTgAAT 
?%Q 66 01 ACgATgCCTg CAgCAATggC 

6 651 ACTACTTACT CTAgCTTCCC 
*~ 6 701 ATAAAgTTgC AggACCACTT 

6751 ATTgCTgATA AATCTggAgC 
6801 AgCACTgggg CCAgATggTA 
25 6851 CggggAgTCA ggCAACTATg 
6901 ggTgCCTCAC TgATTAAgCA 
6951 TATACTTTAg ATTgATTTAA 
70 01 gCgTTAAATT TTTgTTAAAT 

7 051 CggCAAAATC CCTTATAAAT 
3 0 7101 TTgTTCCAgT TTggAACAAg 

7151 gTCAAAgggC gAAAAACCgT 

72 01 AT C AC C C T AA TCAAgTTTTT 
7251 ggAAC C CTAA AgggAgCCCC 

73 01 AACgTggCgA gAAAggAAgg 
35 73 51 gCTggCAAgT gTAgCggTCA 

74 01 TTAATgCgCC gCTACAgggC 
7451 TgATAATCTC ATgACCAAAA 
7501 CgTCAgACCC CgTAgAAAAg 



AAAggTACgC TgATTTTAAA CgTgAAATTT 
ATCTCgAATA ATAACTgTTA TTTTTCAgTg 
CACAATACCA ACATgAgTCA gCTTATCgAT 
AATTAATTCg ATgATAAgCT gTCAAACATg 
ggCCTCgTgA TACgCCTATT TTTATAggTT 
TTCTTAgACg TACgT CAggT ggCACTTTTC 
CCTATTTgTT TATTTTTCTA AATACATTCA 
ACAATAACCC TgATAAATgC TTCAATAATA 
gTATT CAACA TTTCCgTgTC gCCCTTATTC 
CTTCCTgTTT TTgCTCACCC AgAAACgCTg 
AgATCAgTTg ggTgCACgAg TgggTTACAT 
gTAAgATCCT TgAgAgTTTT CgCCCCgAAg 
ACTTTTAAAg TTCTgCTATg TggCgCggTA 
gCAAgAgCAA CTCggTCgCC gCATACACTA 
AgTACTCACC AgTCACAgAA AAgCATCTTA 
gAATTATgCA gTgCTgCCAT AAC CATgAgT 
ACTTCTgACA ACgATCggAg gACCgAAggA 
ACATgggggA TCATgTAACT CgCCTTgATC 
gAAgCCATAC CAAACgACgA gCgTgACACC 
AACAACgTTg CgCAAACTAT TAACTggCgA 
ggCAACAATT AATAgACTgg ATggAggCgg 
CTgCgCTCgg CCCTTCCggC TggCTggTTT 
CggTgAgCgT gggTCTCgCg gTATCATTgC 
AgCCCTCCCg TATCgTAgTT AT CTACACgA 
gATgAACgAA ATAgACAgAT CgCTgAgATA 
TTggTAACTg TCAgACCAAg TTTACTCATA 
ATTgTAAACg TTAATATTTT gTTAAAATTC 
CAgCTCATTT TTTAACCAAT AggCCgAAAT 
CAAAAgAATA gACCgAgATA gggTTgAgTg 
AgTCCACTAT TAAAgAACgT ggACTCCAAC 
CTATCAgggC gATggCCCAC TACgTgAACC 
TggggTCgAg gTgCCgTAAA gCACTAAATC 
CgATTTAgAg CTTgACgggg AAAgCCggCg 
gAAgAAAgCg AAAggAgCgg gCgCTAgggC 
CgCTgCgCgT AACCACCACA CCCgCCgCgC 
gCgTAAAAgg ATCTAggTgA AgATCCTTTT 
TCCCTTAACg TgAgTTTTCg TTCCACTgAg 
AT C AAAggAT CTTCTTgAgA TCCTTTTTTT 
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7551 








7601 








7651 


gCTTCAgCAg 






7701 


TTAggCCACC 




5 


7751 


afTAATPrTn 






7801 


^^yyy y y 






7851 


TgAACgggg g 






7901 


CgAACTgAgA 








AAgggAgAAA 


10 


8 0 01 


gAgCgCACgA 






8051 


TgT CgggTTT 








CA 99"y"9'y"y"Cg 


Q 




8151 


TTPPTrn-rPrT 

i. iLLiygLUJ. 


i 






rprp/-irp 


| 


5 




CTCgCCgCAg 


i 






gAAgAgCgCC 








ACACCgCATA 


- , 




8401 


TAAgCCAgTA 






8451 


CCCgACACCC 


| 


0 


8501 


CCggCATCCg 


P 




8551 


TCAgAggTTT 



TCTgCTgCTT gCAAACAAAA 
CCggATCAAg AgCTACCAAC 
AgCgCAgATA CCAAATACTg 
ACTT CAAgAA CTCTgTAgCA 
TTACCAgTgg CTgCTgCCAg 
CTCAAgACgA TAgTTACCgg 
gTTCgTgCAC ACAgCCCAgC 
TACCTACAgC gTgAgCATTg 
ggCggACAgg TAT C CggTAA 
gggAgCTTCC AgggggAAAC 
CgCCACCTCT gACTTgAgCg 
gAgCCTATgg AAAAACgCCA 
TTTgCTggCC TTTTgCTCAC 
gTggATAACC gTATTACCgC 
CCgAACgACC gAgCgCAgCg 
TgATgCggTA TTTTCTCCTT 
TggTgCACTC TCAgTACAAT 
TACACTCCgC TATCgCTACg 
gCCAACACCC gCTgACgCgC 
CTTACAgACA AgCTgTgACC 
TCACCgTCAT CAC CgAAACg 



AAACCACCgC TACCAgCggT 
TCTTTTTCCg AAggTAACTg 
TCCTTCTAgT gTAgCCgTAg 
CCgCCTACAT ACCTCgCTCT 
TggCgATAAg TCgTgTCTTA 
ATAAggCgCA gCggTCgggC 
TTggAgCgAA CgACCTACAC 
AgAAAgCgCC ACgCTTCCCg 
gCggCAgggT CggAACAggA 
gCCTggTATC TTTATAgTCC 
TCgATTTTTg TgATgCTCgT 
gCAACgCggC CTTTTTACgg 
ATgTTCTTTC CTgCgTTATC 
CTTTgAgTgA gCTgATACCg 
AgT CAgTgAg CgAggAAgCg 
ACgCATCTgT gCggTATTTC 
CTgCTCTgAT gCCgCATAgT 
TgACTgggTC ATggCTgCgC 
CCTgACgggC TTgTCTgCTC 
gTCTCCgggA gCTgCATgTg 
CgCgAggCAg 
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Table 254: restriction map of pD2pick (MFaPrePro : : EPI-HNE-3 ) 



Non-cutters 

A flu Apal AscI Aval Avrll 

5 BanHI Bglll BssHII BstE.ll Mlul 

Pad Pmll Rsrll Sacll Sfil 

SnaBI Spel Xhol Xmal 



Cutters, 3 or fewer sites 



10 


Aatll 


1 


1098 






EcoRV 


2 


1885 


5660 




AflHI 


1 


8179 






Esp3I (Bsal) 


2 


3120 


8530 


..; 


Age I 


1 


1436 






Bspl (Bpull02I) 


1 


597 






AlwSl 


3 


2828 


2852 


7765 


Fspl 


2 


1960 


6629 




ApaLI 


3 


6182 


7865 


8363 


Hindi I I 


3 


885 


1717 


fO.5 


Asel 


3 


591 


5822 


6678 


Bpal 


2 


1017 


2272 




Bgll . 


3 


284 


2717 


6730 


Kpnl 


2 


2323 


2934 


ill 


BsaAI 


2 


7191 


8427 




Mscl 


2 


2204 


3789 


O 


Bsgl 


2 


2545 


4494 




Ncol 


1 


3766 




m 


BsiWI 


3 


1568 


2301 


5929 


Ndel 


1 


8357 




mo 


BspDI 


2 


1723 


5795 




NgoMl 


2 


4702 


7294 




BspEI 


1 


3978 






Nhel 


2 


1929 


2875 




BspMI 


1 


4576 






Notl 


3 


6 


5712 




Bstll07I 


1 


8408 






Nrul 


1 


5208 






BstBl (AsuII) 


1 


945 






Nsil 


2 


684 


1241 


25 


BstXl 


3 


711 


2765 


2896 


PflMI 


2 


196 


1302 




Bsu3 6I 


1 


2223 






Pmel 


1 


420 






Dralll 


2 


3754 


7188 




PpuMl 


2 


142 


4339 




Eagl 


3 


7 


5713 


8597 


Ps tl 


1 


6608 






Barall05I 


2 


5077 


6849 




Pvul 


1 


6482 




30 


Ecll361 


1 


216 






PvuII 


2 


1600 


4497 




Sco47III 


2 


1932 


4795 




Sad 


1 


216 






EcoNI 


3 


3433 


4923 


5295 


Sail 


1 


3312 






EcoRI 


1 


1383 






Seal 


2 


1360 


6371 



3 phi 
Sspl 

StuI 

rtMin 

Xbal 
Xcml 



1 4863 

3 2806 6047 6983 

1 3395 

1 8432 

1 2168 

1 711 
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Table 400: Amino-acid Sequence of ITI light chain (SEQ ID 
NO. 077) 

111111 111122 
12345 6789012345 678901 
avlpq eeegsgggql vtevtk 



2222222233333333334444444444555555555566666666667777777 
2345678901234567890123456789012345678901234567890123456 

KEDSCQLGYSAGPCMGMTSRYFYNGTSMACETFQYGGCMGNGNNFVTEKECLQTC 

1 — 1 , 1 1 1 1 , 1 y 

77788 
78901 
rtvaa 



lllllllllliiiililiilllllllllllllllll 
88888888 99 99 99999 9000000000011111111112222222222333333 
2345678901234567890123456 78901234567890123456789012345 
CNLPIVRGPCRAFIQLWAFDAVKGKCVLFPYGGCQGNGNKFYSEKECREYCGVP 



111111111111 
333344444444 
678901234567 
gdgdeellrf sn 



ITI-D1 comprises residues 22-76 and optionally one of 
residue 77, residues 77 and 78, or residues 77-79. 
ITI-D2 comprises residues 80-135 and optionally one of 
residue 79 or residues 78-79. 



The lines under the sequences represent disulfides. 
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TABLE 602: Physical properties of hNE inhibitors derived from Kunitz domains 



Protein 


Parent 


u 
tt 

Resid 
ues 


MOI wt 


Pre- 
dicted pi 


K D 

(pM) 


(10 6 / 
M/s) 


(lO 6 / 
s) 


EPI-HNE-l 


BPTI 


58 


6359 


9 . 10 


2 . 0 


3.7 


7 . 4 


EPI-HNE-2 


BPTI 


62 


6759 


4.89 


4.9 


4 . 0 


20. 


EPI-HNE-3 


ITI-D2 


56 


6179 


10 . 04 


6.2 


8 . 0 


50 . 


EPI-HNE-4 


ITI-D2 


56 


6237 


9 . 73 


4.6 


10.6 


49 . 



The constants K D and k on above were measured with [hNE] = 8.47 x 10" 10 molar; 
10 k off was calculated from = K D x k on . 

J TABLE 603: SUMMARY OF PURIFICATION OF EPI-HNE-2 



-15 


STAGE 


Volume 
(ml) 


Concentratio 
n (mg/ml) 


Total 
(mg) 


Activity 
(mg/A 280 ) 


Pi 


HARVEST 


3,300 


0.70 


2.31 


< 0.01 


go 


30K ULTRA- 
FILTRATION 
FILTRATE 


5,000 


0.27 


1.40 


< 0.01 




5K ULTRA- 
FILTRATION 
RETENTATE 


1,000 


1.20 


1.20 


0.63 


25 


AMMONIUM 

SULFATE 

PRECIPITATE 


300 


2.42 


0.73 


1.05 




IEX pH6.2 
ELUATE 


98 


6.88 


0.67 


1.03 


30 


EPI-HNE-3, 
LOT 1 


50 


13.5 


0.68 


1.04 
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TABLE 604: SUMMARY OF PURIFICATION OF EPI-HNE-3 



STAGE 


VOLUME 
(ml) 


CONCENTRATIO 
N 

(mg/ml) 


TOTAL 

(mg) 


ACTIVIT 
Y 

(mg/A 280 ) 


HARVEST 


3,100 


0.085 


263 


nd 


30K ULTRA- 
FILTRATION 
FILTRATE 


3,260 


0.055 


179 


0.007 


FIRST IEX: 

pH6.2 

ELUATE 


180 


0.52 


94 


0.59 


AMMONIUM 

SULFATE 

PRECIPITATE 


100 


0.75 


75 


0.59 


IEX pH9 
ELUATE 


60 


1.01 


60 


0.59 


EPI-HNE-3 , 
LOT 1 


26 


1.54 


40 


0.45 
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TABLE 605: K, VALUES OF EPI-HNE PROTEINS FOR VARIOUS HUMAN 
SERUM SERINE PROTEASES 





Inhibitor: 








Enzyme 


EPI-HNE-1 


EPI-HNE-2 


EPI-HNE-3 


EPI-HNE-4 


Human Neutrophil 
Elastase 


2 pM 


5 pM 


6 pM 


5 pM 


Human Serum Plasmin 


> 6*iM 


> 100 |iM 


> 100 \M 


>90 (iM 


Human Serum Kallikrein 


>10 jiM 


> 100 jiM 


> 100 iiM 


>90 ^M 


Human Serum Thrombin 


>90 \iM 


> 100 ^M 


> 100 iiM 


>90fiM 


Human Urine Urokinase 


>90nM 


> 100 nM 


> 100 jiM 


>90 fiM 


Human Plasma Factor X a 


>90 nM 


> 100 fiM 


> 100 jiM 


>90 iiM 


Human Pancreatic 
Chymotrypsin 


-10 [iM 


-10 nM 


-30 jiM 


-IOjiM 



Table 607: PEY-33 which produces EPI-HNE-2 



Elapse Fermenter Time 
Hours:minutes 


Cell Density 


Activity in supernatent 
(mg/1) 


41:09 


89 


28 


43:08 


89 


57 


51:54 


95 


92 


57:05 


120 


140 


62:43 


140 


245 


74:45 


160 


360 


87:56 


170 


473 


98:13 


190 


656 


102:25 


200 


678 


109:58 


230 


710 



Fermenter culture growth and EPI-HNE protein secretion by P. pastoris strains PEY-33. 
Time course is shown for fermenter cultures following initiation of methanol-limited 
feed growth phase. Increase in cell mass is estimated by A^. Concentration of 
inhibitor protein in the fermenter culture medium was determined from measurements of 
hNE inhibition by diluted aliquots of cell-free CM obtained at the times indicated and 
stored at -20°C until assay. 
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Table 608: PEY-43 Which produces EPI-HNE-3 



Elapse Fermenter Time 
Hours . minutes 


Cell Density 
(A^) 


Activity in supernatent 
(mg/1) 


44:30 


107 


0.63 


50:24 


70 


9.4 


52:00 


117 


14. 


62:00 


131 


28. 


76:00 


147 


39. 


86:34 


200 


56. 


100:27 


185 


70. 


113:06 


207 


85. 



Fermenter culture growth and EPI-HNE protein secretion by P. pastoris strains PEY-43. 
Time course is shown for fermenter cultures following initiation of methanol-limited 
feed growth phase. Increase in cell mass is estimated by A^. Concentration of 
inhibitor protein in the fermenter CM was determined by assays of hNE inhibition by 
diluted aliquots of cell-free CM obtained at the times indicated and stored at -20°C until 
assay. 
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Table 610: Inhibitory properties of EPI-HNE-2 



lil of EPI-HNE-2 solution 
added 


Percent residual hNE 
activity 


0. 


101 1 


0. 


100 0 


0. 


100 0 


0. 




0. 




0. 


~5T£ 


10. 




20 


71.8 


30. 


59.5 


40 


46.2 


50 


39.2 


55 


32.2 


60 


22.5 


65 


23.5 


70 


15.0 


75 


10.4 


80. 


8.6 


85 


4.8 


90. 


1.4 


95. 


2.0 


100. 


2.5 


120. 


0.2 


150. 


0.2 


200. 


0.04 
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Table 611: hNE inhibitory properties of EPI-HNE-3 





Hi of EPI-HNE-3 
solution added 


Percent residual 
hNE activity 




0. 


101.2 


5 


0. 


100.0 




0. 


100.0 




0. 


100.0 




0. 


100.0 




0. 


98.8 


10 


10. 


81.6 




20. 


66.9 


Q 


30. 


53.4 




40. 


38.0 




50. 


27.6 


np-5 


55. 


21.5 




60. 


13.0 


P 


65. 


11.0 


o 


70. 


7.9 


m 
O 


75. 


3.8 


mo 


80. 


3.3 




85. 


2.1 




90. 


1.8 




100. 


1.6 




110. 


0.8 


25 


120. 


0.7 




160. 


0.6 




200. 


0.2 
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Table 612: pH stability of Kunitz-domain hNE inhibitors 







Percent Residual hNE Inhibitory Activity 




Incubation pH 


Erl-JtlJNli-1 


T?T)T TJXTTT O 

crl-rliNr,-2 


EPI-HNE-3 


EPI-HNE-4 




1.0 


102 


98 


97 


98 




2.0 


100 


97 


97 


100 


5 


2.6 


101 










3.0 


100 


101 


100 


96 




4.0 


98 


101 


102 


94 




5.0 


100 










5.5 




99 


99 


109 




6.0 


100 




103 


99 




6.5 






99 


100 


U.i 


7.0 


93 


103 


103 


93 




7.5 






87 


109 


m 


8.0 


96 




84 


83 


If 5 


8.5 




104 


68 


86 




9.4 


100 




44 


40 




10.0 


98 


102 


27 


34 



Proteins were incubated at 37°C for 18 hours in buffers of defined pH (see text). In all 
20 cases protein concentrations were 1 \lM. At the end of the incubation period, aliquots of 
the reactions were diluted and residual hNE-inhibition activity determined. 
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Table 620: Stability of hNE inhibitory proteins to oxidation by Chloramine-T 





Table 620 


Percent Residual hNE-Inhibitory Activity 


5 


Molar Ratio 
CHL-T: 
Inhibitor 


EPI- 
HNE-1 


EPI- 
HNE-2 


EPI- 
HNE-3 


EPI- 
HNE-4 


od anti 
trypsin 


SLPI 




0 


100 


100 


100 


100 


100 


100 




0.25 




94 












0.29 












93 




0.30 










97 




10 


.48 


102 














.50 




102 


97 


100 


85 






.59 












82 




.88 












73 


irri 


.95 


100 












HiL5 


1.0 




102 


97 


100 


41 




rii 


1.2 












65 


r\ 


1.4 


98 














1.5 




95 












1.9 


102 












So 

SU 


2.0 




102 










2.1 










7 






2.4 












48 




3.0 






97 


100 








3.8 


94 












25 


4.0 




95 












5.0 






94 


100 








5.2 










7 






5.9 












18 




9.5 


95 












30 


10. 




98 


97 


104 








10.4 










>5 






12. 












15 




19. 


92 














30. 






100 


100 







Table 620 


Percent Residual hNE-Inhibitory Activity 


Molar Ratio 
CHL-T: 
Inhibitor 


EPI- 
HNE-1 


EPI- 
HNE-2 


EPI- 
HNE-3 


EPI- 
HNE-4 


cd anti 
trypsin 


SLPI 


50. 






94 


100 







Inhibitors were incubated in the presence of Chloramine-T at the molar ratios indicated 
for 20 minutes at RT. Oxidation reactions were quenched by adding methionine to a 
final concentration of 4 mM. Residual hNE-inhibition activity remaining in the 
quenched reactions is shown as a percentage of the activity observed with no added 
oxidant. Proteins and concentrations in the oxidation reactions are: EPI-HNE-1, (5 
liM); EPI-HNE-2, (10 \iM); EPI-HNE-3,(10 ^M); EPI-HNE-4, (10 ^M); API, (10 |xM 
); and SLPI, (8.5 \lM). 



Table 630: Temperature stability of EPI-HNE proteins 





Residual hNE Inhibitory Activity 


Temperature 
(°Q 


EPI-HNE-1 


EPI-HNE-2 


EPI-HNE-3 


EPI-HNE-4 


0 


97 


101 


96 


100 


23 


100 


103 


105 


103 


37 


100 


97 


99 


98 


45 


103 








52 




101 


100 




55 


99 






98 


65 


94 


95 


87 




69 








82 


75 


100 








80 




101 


79 




85 


106 






63 


93 




88 


57 




95 


64 






48 



Proteins were incubated at the stated temperature for 18 hours in buffer at pH 7.0. In all 
cases protein concentrations were 1 |iM. At the end of the incubation period, aliquots of 
the reactions were diluted and residual hNE-inhibition activity determined. 



Table 711: Mutations that are likely to improve the affinity of a Kunitz domain for hNE 
Most Preferred 

XI 8F; 

[X15I (preferred) , X15V] ; 
Highly Preferred 

[X16A (Preferred) , X16G] ; 

[X17F (preferred) , X17M, X17L, X17I, X17L] ; 

[{X19P, X19S} (equally preferred), X19K, X19Q] ; 
X3 7G; 
X12G; 

Preferred 
X13P; 
X20R; 

X21Y; X21W; 

[X34V (preferred) , X34P] ; 
[X3 9Q, X3 9M] ; 
[X32T, X32L] ; 
[X31Q, X31E, X31V] ; 
[X11T, X11A, X11R] i 
[X10Y, X10S, X10V] ; 
[X4 0G, X4 0A] ; 
X3 6G; 
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Table 720: M13_III_signal : :Human_LACI-D2 : :mature_M13_III 

DNA has SEQ ID NO. 078, amino -acid sequence has SEQ ID NO. 

079. DNA is linear and in vivo it is double stranded. 

Amino-acid sequence is of a protein that is processed in 

vivo by cleavage after Ala_ 1( - the entire gene encodes an 

amino-acid sequence that continues to give a functional M13 

III protein. 

M K K L L F 
-18 -17 -16 -15 -14 -13 
| atg | aaG I aaG | ctt I etc | ttc | 



Hindi I I 



AI PLVVPFYSGA 
-12 -11 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 
| gec | att | cct | ctg | gtg | gta | cct | ttc | tat | tec | ggc | gec | 

I BstXI I 1 Kpnl 1 J KasI \ 

J Xcml I 

KPDFCFLEEDPG 
123456789 10 11 12 
| aag | cct | gac | ttc | tgc | ttc I etc | gag I gag | gat [ ccc | ggg I 
1 Xhol 1 1 Xmal | 

ICRGYITRYF 
13 14 15 16 17 18 19 20 21 22 
| att | tgc | cgc | ggt | tat | att I acg | cgt I tat | ttc | 
I Sacll \ 1 Mlul | 

YNNQTKQCER 
23 24 25 26 27 28 29 30 31 32 
| tat j aat | aac | cag | act | aag | caa | tgt I gag | egg I 
| BsrDI | j BsrI \ 

FKYGGCLGNM 
33 34 35 36 37 38 39 40 41 42 
| ttc | aag | tat j ggt | ggt | tgc | eta | ggt | aat | atg | 
1 Avrllj 

NNFETLEECK 
43 44 45 46 47 48 49 50 51 52 

| aac | aac | ttc | gag | act | eta | gaa | gag | tgt | aag | 
I Xbal 1 

NICEDGGAETVES 
53 54 55 56 57 58 100 101 102 103 104 105 106 
| aac | ata | tgt | gag | gat | ggt | ggt | get | gag | act | gtt | gag | tct | 
I Ndel [ | DrdI |_ 

iioi is the first residue of mature M13 III. 
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Table 725: Synthetic laci-dl with sites for cloning into 
display vector 

DNA has SEQ ID NO. 08 0, amino -acid sequence has SEQ ID NO. 



AAEMHS FCAFKAD 
123456789 10 

3 » -gcg i gcc | gag atg | cat I tec | ttc | tgc | get | ttc | aaa | get | gat | 
\Eagl |_ 1 Nail |_ 

DGPCKAIMKR 
11 12 13 14 15 16 17 18 19 20 
| gaC | ggT | ccG | tgt | aaa | get | ate | atg | aaa | cgt I 
I RsrlT | [ BgpHl| 

FFFNIFTRQC 
21 22 23 24 25 26 27 28 29 30 
| ttc | ttc | ttc | aac | att | ttc I acG | cgt I cag | tgc | 
1 Mlul 1 



EEFIYGGCEGNQ 
31 32 33 34 35 36 37 38 39 40 41 42 
| gag I gaA | ttC I att | tac | ggt | ggt | tgt | gaa I ggt | aac | cag | 



BstEII 1 



NRFESLEE 
43 44 45 46 47 48 49 50 
| aac | cgG I ttc | gaa I tct | ctA | gag j gaa | 



Aqel 



BstBI | | Xbal | 



CKKMCTRDGA 
51 52 53 54 55 56 57 58 59 101 
| tgt | aag | aag | atg | tgc | act | cgt | gac I ggc gcc 



KasI 



Ala 101 is the first residue of mature M13 III. 
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Table 730: LACI-D1 hNE Library 

DNA has SEQ ID NO. 0 82, amino-acid sequence has SEQ ID MO 
083 

AAEMHSFCAFKA 
123456789 
5 ' -gcg | gcc | gag I atg | cat I tec | ttc | tgc | get | ttc | aaa | get | 



\Eacrl 

T N 
R K R 
G S A 
H E G 
N D G 
11 12 



Nsil 



H R 
PL 



C V|I A | G I|V 
14 15 16 17 



s 




T 


N 




I 


M 




Q 


H 




L 


P 




K 


R 


R 


19 


20 



| NRt | RVS | ggT j cNt | tgt | Rtt | gSt | Ntc | ttc | MNS | cgt | 

C 
Y|W 

F|L ffniftrqc 

21 22 23 24 25 26 27 28 29 30 
| tDS | ttc | ttc | aac | att | ttc I acG | cgt I cag | tgc j 
[ Mlul j 



L 
T 

L I Q V 
E|V E 

31 32 



Y 
35 



C 
38 



P 
K 
M 

A G | A 

40 



EG 

q|r 

42 



| SWG | VHA | ttC | VHA | tac | ggt | ggt | tgt | VHG | gSt | aac | SRG | 



43 44 45 46 47 48 49 50 

| aac I cgG I ttc | gaa I tct | ctA | gag | gaa | 
! I BstBI I 1 XbaT | 

1 Agrel | 

CKKMCTRDGA 
51 52 53 54 55 56 57 58 59 101 
| tgt | aag | aag | atg | tgc | act | cgt | gac I ggc gcc 

1 KasI | 



Variegation at 10, 11, 13, 15, 16, 17, 19, and 20 gives rise to 253,400 amino-acid 
sequences and 589,824 DNA sequences. Variegation at 31, 32, 34, 39, 40, and 42 gives 
23,328 amino-acid and DNA sequences. There are about 5.9 x 10 9 protein sequences 
and 1.4 x 10 10 DNA sequences. 
Ala 101 would be the first residue of mature M13 III. 



Table 735: LAC I - D2 hNE Library 

DNA has SEQ ID NO. 084; amino -acid sequence has SEQ ID NO. 







P 


H 








T 


N 




c 


R 


K 


R 




s 


G 


S 


A 




Y 


H 


E 


G 




D 


N 


D 


Q 


G 


10 


11 


12 



F 
I 
Y 

C V|I G | A N 
15 







I 


N 


L 




Q 


M 


V 




L 


H 


H 




K 


P 


D 


F 


T 


R 


17 


18 


19 



F L 
Y W 



| MNt | tgc | Rtt | gst | NWt | ttt | MNS | cgt [ tDS | 



22 
ttc I 



Y 
23 



N 
24 



Q 
26 



*j 27 ZH 23 30 31 

| tat | aat | aac | cag I Get | aag | caa | tgt | SWg | 

I i r, 



K 
28 



Q 

29 



L I Q 
C E I V 
30 31 



Q 


G 


L 


P 


T 


K 


V 


I 


E 


A 


R 


32 



Espl 



| BarDl\ 



F 
33 



L 
T 
E 
A 

K 
34 



Y 
35 



G 
36 



G 
37 



Q 


P 






T 


K 




R I G 


V 


M 




K E 


E 


A 




L Q 


L 


G | A 


N M| V 


39 


40 


41 42 



| ttc I VHA | tat | ggt | ggt | tgc | VHG | gSt | aat | VBg | 



NNFETLEECK 
43 44 45 46 47 48 49 50 51 52 
| aac j aac [ ttc | gag | act | eta | gaa | gag | tgt | aag | 



| Xbal | 



N 
53 



CEDGGAETVES 
55 56 57 58 100 101 102 103 104 105 106 

^ ? »t f ' 1 gat ! ggt ' ggt ' gCt 1 9 a jl^ act 1 1 gtt | gag | tct | 



DrdI 



| Ndel | 

x 10 10 amino acid sequences; 1.238 x 10 11 DNA sequences 
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Table 790 

in hNE-in 


: Amino acids preferred 
libiting Kunitz domains 




Position 


Allowed amino acids 




5 


C 


5 


10 


YSV, (NA) 




11 


TAR, (QP) 




12 


G 




13 


P, (VALI) 




14 


C 


10 


15 


IV 


u 


16 


AG 




17 


FM, ILV(A) 




18 


F 




19 


PS, QK 


_:;15 


20 


R 




21 


YW, (F) 




30 


C 


P 


31 


QEV, (AL) 


f*S 


32 


TL, (PSA) 


m 20 


33 


F 




34 


VP 




35 


Y 




36 


G 




37 


G 


25 


38 


C 




39 


MQ 




40 


G, A 




41 


N highly preferred 


3 0 


42 
45 


G preferred, A allowed 
F 




51 


C 




55 


C 
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